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POLYNUCLEOTIDES AND POLYPEPTIDES ASSOCIATED WITH 
ANTIBIOTIC BIOSYNTHESIS AND USES THEREFOR 

FIELD OF THE INVENTION 

THIS INVENTION relates generally to antibiotic biosynthesis. More particularly, 
5 the present invention relates to polyketides and the polyketide synthases and ancillary 
enzymes that are capable of producing such compounds. Even more particularly, the 
present invention relates to a polyketide synthase linked to a non-ribosomal peptide 
synthetase involved in the biosynthesis of albicidins, to a phosphopantetheinyl transferase 
for activating enzymes, particularly polyketide synthases and/or non-ribosomal peptide 

10 synthetases, associated with the biosynthesis of albicidins, and to a methyltransferase for 
methylating precursors of albicidins and/or intermediates related to albicidin biosynthesis. 
The present invention also relates to biologically active fragments of the aforementioned 
polypeptides and to variants and derivatives of these molecules. Further, the invention 
relates to polynucleotides encoding the said polypeptides, including the xabA, xabB and 

15 xabC genes of Xanthomonas albilineans, to polynucleotides encoding the said fragments, 
variants or derivatives, to vectors comprising the said polynucleotides and to host cells 
containing such vectors. The invention also relates to a transcriptional control element for 
modulating the expression of polynucleotides including, for example, the xabB gene and/or 
the xabC gene of Xanthomonas albilineans, or variants thereof The invention also features 

20 methods of using the polynucleotides, polypeptides, fragments, variants, derivatives and 
vectors for activating polyketide synthases and/or non-ribosomal peptide synthetases, for 
methylating precursors of albicidins or their analogues and/or intermediates involved in the 
biosynthesis of albicidins or their analogues and for enhancing the level and/or functional 
activity of albicidins or their analogues. The invention also encompasses methods of using 

25 the aforesaid polynucleotides, polypeptides, fragments, variants and derivatives for the 
biosynthesis of albicidins or analogues thereof. 

Bibliographic details of various publications referred to by author in this 
specification are collected at the end of the description. 
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BACKGROUND OF THE INVENTION 

Polyketides represent a large structurally diverse group of compounds synthesised 
from 2-carbon units through a series of condensations and subsequent modifications. They 
possess a broad range of biological activities including antibiotic and pharmacological 
5 properties. For example, polyketides are represented by antibiotics such as tetracyclines, 
erythromycins, immunosuppressants such as FK506, FK520 and rapamycin, anticancer 
agents such as daunomycin and veterinary products such as monensin and avermectin. 

Considering the difficulty in producing polyketide compounds by conventional 
chemical methodologies, and the typically low production of polyketides in wild-type 

10 cells, there has been considerable interest in finding improved or alternate means to 
produce polyketide compounds. In this regard, reference may be made to PCT publication 
Nos. WO 93/13663; WO 95/08548; WO 96/40968; WO 97/02358; and WO 98/27203; 
U.S. Pat. Nos. 4,874,748; 5,063,155; 5,098,837; 5,149,639; 5,672,491; and 5,712,146; Fu 
et al (1994, Biochemistry 33: 9321-9326); McDaniel et al (1993, Science 262: 1546- 

15 1550); and Rohr (1995, Angew. Chem. Int. Ed. Engl 34(8): 881-888). 

Polyketides are synthesised in nature by polyketide synthases (PKS). These 
enzymes, which are actually complexes of multiple enzyme activities, are in some ways 
similar to, but in other ways different from, the synthases that catalyse condensation of 2- 
carbon units in the biosynthesis of fatty acids. Specifically, PKS enzymes catalyse the 

20 biosynthesis of polyketides through repeated (decarboxylase) Claisen condensations 
between acylthioesters {e.g., acetyl, propionyl, malonyl or methylmalonyl). Following each 
condensation, they introduce structural variability into the product by catalysing all, part, 
or none of a reductive cycle comprising a ketoreduction, dehydration, and enoylreduction 
on the j3-keto group of the growing polyketide chain. PKS enzymes incorporate enormous 

25 structural diversity into their products, in addition to varying the condensation cycle, by 
controlling choice of primer, extender units, and the overall chain length and, particularly 
in the case of aromatic polyketides, regiospecific cyclisation of the nascent polyketide 
chain. After the carbon chain has grown to a length characteristic of each specific product, 
it is released from the synthase by thiolysis or acyltransfer. Thus, the PKS complexes 

30 consist of families of enzymes which work together to produce a given polyketide. It is the 
choice of chain-building units, controlled variation in chain length, and the reductive cycle, 
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genetically programmed into each PKS, that contributes to the variation seen among 
naturally occurring polyketides. 

Two major types of PKS enzymes are known; these differ in their composition 
and mode of synthesis of the polyketide synthesised. These two major types of PKS 
5 enzymes are commonly referred to as Type I or "modular" and Type H "iterative" PKS 
enzymes. These classifications are well known and reference maybe made, for example, to 
Hopwood and Khosla (1992). 

The Type I or modular PKS enzymes typically catalyse the biosynthesis of 
complex polyketides such as erythromycin and avermectin. These modular enzymes 

1 0 include assemblies of several large multifunctional proteins carrying, between them, a set 
of separate active sites for each step of carbon chain assembly and modification (Cortes et 
al, 1990; Donadio et al, 1991; MacNeil et al, 1992). Accordingly, modular PKS 
complexes can be viewed as biochemical assembly lines, composed of a series of catalytic 
domains involved in sequential assembly and modification of acyl groups on the growing 

1 5 polyketide chain (Cane et al, 1 998; Keating and Walsh, 1999). The catalytic domains are 
arranged in "modules", punctuated by acyl carrier protein (ACP) domains that tether the 
nascent polyketide while it undergoes the catalytic modifications programmed in the 
associated module. For each polyketide there is an initiation module, a series of elongation 
modules that define the length and structure of the polyketide chain, and a termination 

20 module to release the product from the final tether. The initiation module typically 
comprises an acyl transferase (AT) domain that couples the initial acyl group from an acyl- 
CoA substrate to the phosphopantetheinyl tether of the first ACP domain. Each elongation 
module typically comprises a ketosynthase (KS), an AT and an ACP. The KS removes the 
growing polyketide unit from the upstream ACP and couples it to the next acyl group in 

25 the chain, which has already been selected and loaded by the AT onto the ACP in the same 
module. Other catalytic domains (eg. a ketoacyl reductase (KR), and dehydratase (DH)) 
within an elongation module can modify the newly elongated polyketide before it is 
transferred to the next module in the biochemical assembly line. A thioesterase (TE) 
domain in the termination module accomplishes release of the assembled polyketide from 

30 the last ACP in the series (Cane et al, 1998; Keating and Walsh, 1999). 
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Biosynthesis of a polyketide can involve the sequential action of several PKS 
proteins, each with one to six elongation modules (MacNeil et aL, 1992; Apricio et aL, 
1996). There are variations on the modular PKS design, including participation by some 
loading domains across modules or in trans from separate proteins (Keating and Walsh, 
5 1999), and several examples of hybrid PKS/NRPS proteins (Albertini et aL, 1995; Gehring 
et aL, 1998; Duitman et aL, 1999; Paitan et aL, 1999). Subsequent modification of the 
polyketide by dedicated tailoring enzymes is generally required to complete the 
biologically active product (Hopwood, 1997). Other biologically active compounds 
including antibiotics comprise polypeptides assembled by non-ribosomal peptide 

10 synthetases (NRPSs). NRPSs typically show a modular architecture and tethered 
biosynthetic strategy analogous to PKSs (Cane et aL, 1998; Keating and Walsh, 1999). In 
NRPSs a condensation (C) domain removes the growing peptide unit from the upstream 
PCP domain and couples it to the next amino acid group in the chain, which has already 
been selected and loaded by an adenylation (A) domain onto the PCP in the same module 

15 (Marahiel et aL, 1997; Stachelhaus et aL, 1998). Other catalytic domains {e.g., epimerase 
or N-methytransferase) within an elongation module can modify the newly elongated 
polypeptide before it is transferred to the next module in the biochemical assembly line 
(Marahiel etaL, 1997). 

Many phytopathogenic bacteria and fungi secrete toxins with phytotoxic activity 
20 and a broad spectrum of antimicrobial properties (Guenzi et aL, 1998). Albicidin 
phytotoxins are polyketides produced by Xanthomonas albilineans, which are key 
pathogenicity factors in the development of leaf scald, one of the most devastating diseases of 
sugarcane (Saccharum, interspecific hybrids) (Ricaud and Ryan, 1989; Zhang and Birch, 
1997; Zhang et aL, 1999). Albicidins selectively block prokaryote DNA replication and cause 
25 the characteristic chlorotic symptoms of leaf scald disease by blocking chloroplast 
development (Birch and Patil, 1983; 1985b; 1987a; 1987b). Because albicidins are rapidly 
bactericidal at nanomolar concentrations against a broad range of Gram-positive and 
Gram-negative bacteria, they are also of interest as potential clinical antibiotics (Birch and 
Patil, 1985a). 

30 The major antimicrobial component of the family of albicidins produced in 

culture by X albilineans has been partially characterised as a low M T compound with 
several aromatic rings (Birch and Patil, 1985a). Low yields have slowed studies into the 
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chemical structure of albicidin, its application as a tool to study prokaryote DNA 
replication, and its development as a clinical antibiotic (Zhang et aL, 1998). Genetic 
analysis of albicidin biosynthesis is likely to indicate approaches to increase yields, 
probable structural features, and opportunities for engineering novel antibiotics in this 
family. 
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SUMMARY OF THE INVENTION 

The present invention arises in part from the identification and characterisation of 
several X albilineans genes associated with albicidin biosynthesis. In particular, the 
present inventor has isolated a novel X albilineans gene (xabB), which encodes a large 
5 protein (predicted Mr 525,695), with a modular architecture indicative of a multifunctional 
PKS linked to a non-ribosomal peptide synthetase (NRPS). At 4801 amino acids in length, 
the product of xabB (XabB) is the largest reported PKS-NRPS. Twelve catalytic domains 
in this multifunctional enzyme are arranged in the order N-terminus-acyl-CoA ligase (AL)- 
acyl carrier protein (ACP)-j3-ketoacyl synthase (KS)-j3-ketoacyl reductase (KR)-ACP- 

10 ACP-KS-peptidyl carrier protein (PCP)-condensation domain (C)-adenylation domain (A)- 
PCP-C. The modular architecture of XabB indicates likely steps in albicidin biosynthesis, 
and approaches to enhance antibiotic yield. The novel pattern of domains, in comparison 
with known PKS-NRPS enzymes for antibiotic production, also contributes to the 
knowledge base for rational design of enzymes producing novel antibiotics. The present 

15 inventor has found that XabB is required for the production of albicidins and that enhanced 
expression of xabB leads to increased levels and/or functional activities of albicidin 
antibiotics. 

A gene (xabQ encoding a novel O-methyltransferase has also been isolated, 
which methylates albicidin precursors and/or intermediates involved in albicidin 
20 biosynthesis. Surprisingly, enhanced expression of xabC has been found to increase the 
levels and/or functional activities of albicidin antibiotics. 

The present inventor has also isolated a gene (xabA) encoding a 
phosphopantetheinyl transferase (PPTase), which is required for post-translational 
activation of synthetases in the albicidin biosynthetic pathway. In this regard, it is known 

25 that inefficient phosphopantetheinylation has limited the activity of other antibiotic 
synthetases overexpressed in heterologous species (Walsh et aL\ 1997). Accordingly, the 
isolated xabA gene, together with its target in the albicidin biosynthetic pathway 
xabB), provide the means to engineer high level co-expression of the albicidin synthetase 
and its activating PPTase to obtain albicidins in higher yields, and ultimately to manipulate 

30 the elements of the albicidin biosynthetic machinery, by mutagenesis or by other means, to 
produce desired structural variants of this novel antibiotic class. 
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The above genes, in whole or in part, together with their variants and derivatives, 
are useful inter alia for modulating the level and/or functional activity of albicidins, for 
expressing PKS enzymes in recombinant host cells, for producing polyketides including 
albicidins and their analogues and for combinatorial biosynthesis, as described hereinafter. 

5 Accordingly, one aspect of the present invention contemplates an isolated 

polypeptide encoding at least a portion of an albicidin PKS-NRPS (XabB) or its variants or 
derivatives. In one embodiment of this type, the invention provides an isolated polypeptide 
comprising at least one domain selected from the group consisting of: 

(a) an acyl-CoA ligase (AL) domain comprising a sequence set forth in any one or 
1 0 more of SEQ ID NO: 6 and 8, or variants thereof. 

(b) a /3-ketoacyl synthase (KS) domain comprising a sequence set forth in any one or 
more of SEQ ID NO: 10, 12, 14, 16, 18 and 20, or variants thereof; 

(c) a /?-ketoacyl reductase (KR) domain comprising the sequence set forth SEQ ID 
NO: 22, or variants thereof; 

15 (d) an acyl carrier protein (ACP) domain comprising a sequence set forth in any one 

or more of SEQ ID NO: 24, 26 and 28, or variants thereof; 

(e) an adenylation (A) domain comprising a sequence set forth in any one or more of 
SEQ ID NO: 30, 32, 34, 36, 38, 40, 42, 44, 46 and 48, or variants thereof; 

(f) a peptidyl carrier protein (PCP) domain comprising a sequence set forth in any 
20 one or more of SEQ ID NO: 50 and 52, or variants thereof; and 

(g) a condensation (C) domain comprising a sequence set forth in any one or more of 
SEQ ID NO: 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78 and 80, or variants 
thereof. 

Preferably, the AL domain comprises each of the sequences set forth in SEQ ID 
25 NO: 6 and 8, or variants thereof. 

In one embodiment, the KS domain preferably comprises each of the sequences 
set forth in SEQ ID NO: 10, 12 and 14, or variants thereof. In an alternate embodiment, the 
KS domain preferably comprises each of the sequences set forth in SEQ ID NO: 16, 18 and 
20, or variants thereof. 

30 Preferably, the A domain comprises each of the sequences set forth in SEQ ID 

NO: 30, 32, 34, 36, 38, 40, 42, 44, 46 and 48, or variants thereof. 
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In one embodiment, the C domain preferably comprises each of the sequences set 
forth in SEQ ID NO: 54, 56, 58, 60, 62, 64 and 66, or variants thereof. In an alternate 
embodiment, the C domain preferably comprises each of the sequences set forth in SEQ ID 
NO: 68, 70, 72, 74, 76, 78 and 80, or variants thereof. 

5 In another embodiment, the invention provides an isolated polypeptide comprising 

at least a biologically active fragment or portion of the sequence set forth in SEQ ID NO: 
2, or a variant or derivative thereof. 

Suitably, the biologically active fragment is at least 6 amino acids in length. 

In a preferred embodiment, the domains broadly described above are arranged in 
10 an N- to C-terminal direction asfollows: AL-ACP-KS-KR-ACP-ACP-KS-PCP-C-A-PCP- 
C. 

Suitably, the biologically active fragment comprises at least one domain selected 
from the group consisting of the AL domain, the KS domain, the KR domain, the ACP 
domain, the A domain, the PCP domain and the C domain as broadly described above. 

15 Suitably, the variant has at least 60%, preferably at least 70%, more preferably at 

least 80%, more preferably at least 90% and still more preferably at least 95% sequence 
identity to the sequence set forth in SEQ ID NO: 2. 

Preferably, the variant comprises at least one sequence selected from the group 
consisting of SEQ ID NO: 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 

20 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78 and 80, or variant 
thereof. In this regard, the variant preferably has at least 70%, preferably at least 80%, 
more preferably at least 90%, and still more preferably at least 95% sequence identity to 
any one of the amino acid sequences set forth in SEQ ID NO: 6, 8, 10, 12, 14, 16, 18, 20, 
22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 

25 70, 72, 74, 76, 78 and 80. 

In another aspect, the present invention contemplates an isolated polypeptide 
encoding at least a portion of a PPTase (XabA) associated with albicidin biosynthesis or its 
variants or derivatives. In one embodiment of this type, the invention provides an isolated 
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polypeptide comprising at least biologically active fragment or portion of the sequence set 
forth in SEQ ID NO: 83, or a variant or derivative thereof. 

Suitably, the biologically active fragment comprises at least one, and preferably 
both, of the consensus PPTase sequence motifs set forth in SEQ ID NO: 89 and 93, or 
5 variant thereof. Preferably, the biologically active fragment comprises the intervening 
sequence between the said consensus PPTase sequence motifs, which intervening sequence 
comprises the sequence set forth in SEQ ID NO: 91, or variant thereof. 

Preferably, the biologically active fragment comprises a contiguous sequence of 
amino acids contained within the sequence set forth in SEQ ID NO: 87, or variant thereof 

10 Suitably, the variant has at least 60%, preferably at least 70%, more preferably at 

least 80%, more preferably at least 90% and still more preferably at least 95% sequence 
identity to the sequence set forth in SEQ ID NO: 83. 

Preferably, the variant comprises at least one sequence selected from the group 
consisting of SEQ ID NO: 87, 89, 91 and 93, or variant thereof. In this regard, the variant 
15 preferably has at least 70%, preferably at least 80%, more preferably at least 90%, and still 
more preferably at least 95% sequence identity to any one of the amino acid sequences set 
forth in SEQ ID NO: 87, 89, 91 or 93. 

In yet another aspect, the present invention contemplates an isolated polypeptide 
encoding at least a portion of a methyltransferase (XabC) associated with albicidin 
20 biosynthesis or its variants or derivatives. In one embodiment of this type, the invention 
provides an isolated polypeptide comprising at least biologically active fragment or portion 
of the sequence set forth in SEQ ID NO: 95, or a variant or derivative thereof. 

Suitably, the biologically active fragment comprises at least one, and preferably 
all, of the consensus methyltransferase sequence motifs set forth in SEQ ID NO: 99, 101 
25 and 103, or variant thereof. 

Preferably, the biologically active fragment comprises a contiguous sequence of 
amino acids contained within the sequence set forth in SEQ ID NO: 105, or variant thereof. 
In a preferred embodiment, the biologically active fragment comprises a contiguous 
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sequence of amino acids contained within the sequence set forth in SEQ ID NO: 107, or 
variant thereof. 

Suitably, the variant has at least 60%, preferably at least 70%, more preferably at 
least 80%, more preferably at least 90% and still more preferably at least 95% sequence 
5 identity to the sequence set forth in SEQ ID NO: 95. 

Preferably, the variant has at least 70%, preferably at least 80%, more preferably 
at least 90%, and still more preferably at least 95% sequence identity to any one of the 
amino acid sequences set forth in SEQ ID NO: 99, 101 and 103. 

In still yet another aspect, the invention contemplates an isolated polynucleotide 
1 0 encoding at least a portion of an albicidin PKS-NRPS (XabB) or its variants or derivatives, 
as broadly described above. Preferably, the polynucleotide comprises the sequence set 
forth in any one of SEQ ID NO: 1 and 3, or a biologically active fragment thereof, or a 
polynucleotide variant of these. 

Suitably, the biologically active fragment is at least 18 nucleotides in length. 

15 The polynucleotide preferably encodes at least one domain selected from the 

group consisting of the AL domain, the KS domain, the KR domain, the ACP domain, the 
A domain, the PCP domain and the C domain as broadly described above.. 

Suitably, the AL domain is encoded by a nucleotide sequence set forth in any one 
or more of SEQ ID NO: 5 and 7, or variants thereof. Preferably, the AL domain is encoded 
20 by a nucleotide sequence comprising each of the sequences set forth in SEQ ID NO: 5 and 
7, or variants thereof 

The KS domain is preferably encoded by a nucleotide sequence set forth in any 
one or more of SEQ ID NO: 9, 11, 13, 15, 17 and 19, or variants thereof. In one 
embodiment, the KS domain is preferably encoded by a nucleotide sequence comprising 
25 each of the sequences set forth in SEQ ID NO: 9, 11 and 13, or variants thereof. In an 
alternate embodiment, the KS domain is preferably encoded by a nucleotide sequence 
comprising each of the sequences set forth in SEQ ID NO: 15, 17 and 19, or variants 
thereof. 
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Preferably, the KR domain is encoded by a nucleotide sequence set forth in SEQ 
ID NO: 21, or variant thereof. 

Suitably, the ACP domain is encoded by a nucleotide sequence set forth in any 
one or more of SEQ ID NO: 23, 25 and 27, or variants thereof. 

5 The A domain is preferably encoded by a nucleotide sequence set forth in any one 

or more of SEQ ID NO: 29, 31, 33, 35, 37, 39, 41, 43, 45 and 47, or variants thereof. In a 
preferred embodiment, the A domain is encoded by a nucleotide sequence comprising each 
of the sequences set forth in SEQ ED NO: 29, 31, 33, 35, 37, 39, 41, 43, 45 and 47, or 
variants thereof. 

10 Suitably, the PCP domain is encoded by a nucleotide sequence set forth in any 

one or more of SEQ ID NO: 49 and 5 1 , or variants thereof. 

Preferably, the C domain is encoded by a nucleotide sequence set forth in any one 
or more of SEQ ID NO: 53, 55, 57, 59, 61, 63, 65, 67, 69, 71, 73, 75, 77 and 79, or variants 
thereof In one embodiment, the C domain is preferably encoded by a nucleotide sequence 
15 comprising each of the sequences set forth in SEQ ID NO: 53, 55, 57, 59, 61, 63 and 65, or 
variants thereof Lot an alternate embodiment, the C domain is preferably encoded by a 
nucleotide sequence comprising each of the sequences set forth in SEQ ID NO: 67, 69, 71, 
73, 75, 77 and 79, or variants thereof. 

In one embodiment, the polynucleotide variant has at least 60%, preferably at 
20 least 70%, more preferably at least 80%, and still more preferably at least 90% sequence 
identity to any one of the polynucleotides set forth in SEQ ID NO: 1 or 3. 

In another embodiment, the polynucleotide variant is capable of hybridising to 
any one of the polynucleotides identified by SEQ ID NO: 1 or 3 under at least low 
stringency conditions, preferably under at least medium stringency conditions, and more 
25 preferably under high stringency conditions. 

Preferably, the polynucleotide variant comprises a nucleotide sequence encoding 
at least one domain selected from the group consisting of the AL domain, the KS domain, 
the KR domain, the ACP domain, the A domain, the PCP domain and the C domain as 
broadly described above. 
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In one embodiment, the nucleotide sequence variant has at least 60%, preferably 
at least 70%, more preferably at least 80%, and still more preferably at least 90% sequence 
identity to any one of the sequences set forth in SEQ ID NO: 5, 7, 9, 1 1, 13, 15, 17, 19, 21, 
23, 25, 27, 29, 31, 33, 35, 37, 39, 41, 43, 45, 47, 49, 51, 53, 55, 57, 59, 61, 63, 65, 67, 69, 

5 71, 73, 75, 77 and 79. 

In another embodiment, the nucleotide sequence variant is capable of hybridising 
to any one of the sequences identified by SEQ ID NO: 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 
25, 27, 29, 31, 33, 35, 37, 39, 41, 43, 45, 47, 49, 51, 53, 55, 57, 59, 61, 63, 65, 67, 69, 71, 
73, 75, 77 and 79 under at least low stringency conditions, preferably under at least 

10 medium stringency conditions, and more preferably under high stringency conditions. 

In a further aspect, the invention contemplates an isolated polynucleotide 
encoding at least a portion of a PPTase (XabA) associated with albicidin biosynthesis or its 
variants or derivatives, an isolated polynucleotide encoding a polypeptide, fragment, 
variant or derivative as broadly described above. Preferably, the polynucleotide comprises 
15 the sequence set forth in any one of SEQ ID NO: 82 and 84, or a biologically active 
fragment thereof, or a polynucleotide variant of these. 

Alternatively, the polynucleotide comprises a contiguous sequence of nucleotides 
contained within the sequence set forth in SEQ ID NO: 86, or variant thereof. 

In one embodiment, the polynucleotide variant has at least 60%, preferably at 
20 least 70%, more preferably at least 80%, and still more preferably at least 90% sequence 
identity to any one of the polynucleotides set forth in SEQ ID NO: 82, 84 and 86. 

In another embodiment, the polynucleotide variant is capable of hybridising to 
any one of the polynucleotides identified by SEQ ID NO: 82, 84 and 86 under at least low 
stringency conditions, preferably under at least medium stringency conditions, and more 
25 preferably under high stringency conditions. 

Preferably, the polynucleotide variant comprises a nucleotide sequence encoding 
at least one PPTase sequence motif selected from SEQ ID NO: 89 and 93, or variant 
thereof. Suitably, the polynucleotide variant comprises a nucleotide sequence encoding the 
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intervening sequence between the said consensus PPTase sequence motifs, said nucleotide 
sequence comprising the sequence set forth in SEQ ID NO: 91. 

The polynucleotide variant suitably comprises a nucleotide sequence encoding a 
contiguous sequence of amino acids contained within the sequence set forth in SEQ ID 
5 NO: 87, or variant thereof. In this instance, the contiguous sequence is preferably encoded 
by the sequence set forth in SEQ ID NO: 86, or nucleotide sequence variant thereof 

Suitably, the PPTase sequence motif is encoded by a nucleotide sequence 
comprising the sequence set forth in any one of SEQ ID NO: 88 and 92, or nucleotide 
sequence variant thereof. 

10 Preferably, the said intervening sequence is encoded by the nucleotide sequence 

set forth in SEQ ID NO: 90, or nucleotide sequence variant thereof. 

In one embodiment, the nucleotide sequence variant has at least 60%, preferably 
at least 70%, more preferably at least 80%, and still more preferably at least 90% sequence 
identity to any one of the sequences set forth in SEQ ID NO: 86, 88, 90 and 92. 

15 In another embodiment, the nucleotide sequence variant is capable of hybridising 

to any one of the sequences identified by SEQ ID NO: 86, 88, 90 and 92 under at least low 
stringency conditions, preferably under at least medium stringency conditions, and more 
preferably under high stringency conditions. 

In yet a further aspect, the invention contemplates an isolated polynucleotide 
20 encoding at least a portion of a methyltransferase (XabC) associated with albicidin 
biosynthesis or its variants or derivatives. Preferably, the polynucleotide comprises the 
sequence set forth in any one of SEQ ID NO: 94 and 96, or a biologically active fragment 
thereof, or a polynucleotide variant of these. 

Alternatively the polynucleotide comprises a contiguous sequence of nucleotides 
25 contained within the sequence set forth in SEQ ID NO: 104, or variant thereof. In one 
embodiment, this polynucleotide preferably comprises a contiguous sequence of 
nucleotides contained within the sequence set forth in SEQ ID NO: 106, or variant thereof 
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In one embodiment, the polynucleotide variant has at least 60%, preferably at 
least 70%, more preferably at least 80%, and still more preferably at least 90% sequence 
identity to any one of the polynucleotides set forth in SEQ ID NO: 94, 96, 104 and 106. 

In another embodiment, the polynucleotide variant is capable of hybridising to 
5 any one of the polynucleotides identified by SEQ ID NO: 94, 96, 104 and 106 under at 
least low stringency conditions, preferably under at least medium stringency conditions, 
and more preferably under high stringency conditions. 

Preferably, the polynucleotide variant comprises a nucleotide sequence encoding a 
methyltransferase sequence motif selected from any one or more of SEQ ID NO: 99, 101 
1 0 and 1 03, or variant thereof. 

Suitably, the methyltransferase sequence motif is encoded by a nucleotide 
sequence comprising the sequence set forth in any one of SEQ ID NO: 98, 100 and 102, or 
nucleotide sequence variant thereof. 

In one embodiment, the nucleotide sequence variant has at least 60%, preferably 
15 at least 70%, more preferably at least 80%, and still more preferably at least 90% sequence 
identity to any one of the sequences set forth in SEQ ID NO: 98, 100 and 102. 

In another embodiment, the nucleotide sequence variant is capable of hybridising 
to any one of the sequences identified by SEQ ID NO: 98, 100 and 102 under at least low 
stringency conditions, preferably under at least medium stringency conditions, and more 
20 preferably under high stringency conditions. 

In still a further aspect, the invention features an expression vector comprising a 
polynucleotide as broadly described above wherein the polynucleotide is operably linked 
to a regulatory polynucleotide. 

In another aspect, the invention provides a host cell containing a said expression 

25 vector. 

Suitably, the host cell is a bacterium or other prokaryote. 

In yet another aspect, the invention is directed to a multiplicity of cell colonies, 
constituting a library of colonies, wherein each colony of the library contains an expression 
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vector for the production of a polypeptide, fragment, variant or derivative as broadly 
described above. 

The invention also features a method of producing a recombinant polypeptide, 
fragment, variant or derivative as broadly described above, comprising: 
5 - culturing a host cell containing an expression vector as broadly described 

above such that said recombinant polypeptide, fragment, variant or derivative is 
expressed from said polynucleotide; and 

- isolating the said recombinant polypeptide, fragment, variant or derivative. 

In another aspect, the invention provides a method of producing a biologically 
1 0 active fragment of a polypeptide as broadly described above, comprising: 

- detecting an activity associated with a fragment of the polypeptide set forth in 
SEQ ID NO: 2, wherein said activity is selected form the group consisting of acyl-CoA 
ligase activity, /3-ketoacyl synthase activity, 0-ketoacyl reductase, acyl carrier protein 
activity, adenylation activity, peptidyl carrier protein activity and condensation activity; 

15 or 

- detecting PPTase activity associated with a fragment of the polypeptide set 
forth in SEQ ID NO: 83; or 

- detecting methyltransferase activity associated with a fragment of the 
polypeptide set forth in SEQ ID NO: 95; 

20 wherein detection of said activity is indicative of said fragment being a biologically 

active fragment. 

In a further aspect, the invention provides a method of producing a biologically 
active fragment as broadly described above, comprising: 

- introducing a polynucleotide from which a fragment of a polypeptide as 
25 broadly described above can be produced into a cell; and 

- detecting an activity selected form the group consisting of acyl-CoA ligase 
activity, jS-ketoacyl synthase activity, jS-ketoacyl reductase, acyl carrier protein activity, 
adenylation activity, peptidyl carrier protein activity and condensation activity; or 

- detecting PPTase activity associated with a fragment of the polypeptide set 
30 forth in SEQ ID NO: 83; or 
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- detecting methyltransferase activity associated with a fragment of the 
polypeptide set forth in SEQ ID NO: 95; 

wherein detection of said activity is indicative of said fragment being a biologically 
active fragment. 

5 In yet a further aspect, the invention provides a method of producing a variant of a 

polypeptide as broadly described above (parent polypeptide), or a biologically active 
fragment thereof, comprising: 

- producing a modified polypeptide whose sequence is distinguished from the 
parent polypeptide or the biologically active fragment by substitution, deletion or 

10 addition of at least one amino acid; and 

- detecting an activity associated with the modified polypeptide, wherein said 
activity is selected form the group consisting of acyl-CoA ligase activity, £-ketoacyl 
synthase activity, j8-ketoacyl reductase, acyl carrier protein activity, adenylation 
activity, peptidyl carrier protein activity, condensation activity, PPTase activity and 

15 methyltransferase activity, wherein detection of said activity is indicative of said 
modified polypeptide being a variant. 

In a further aspect, the invention contemplates a method of producing a variant of 
a parent polypeptide as broadly described above, or biologically active fragment thereof, 
comprising: 

20 - producing a polynucleotide from which a modified polypeptide as described 

above can be produced; 

- introducing said polynucleotide into a cell; and 

- detecting an activity associated with the modified polypeptide, wherein said 
activity is selected form the group consisting of acyl-CoA ligase activity, /3-ketoacyl 

25 synthase activity, 0-ketoacyl reductase, acyl carrier protein activity, adenylation 
activity, peptidyl carrier protein activity, condensation activity, PPTase activity and. 
methyltransferase activity, wherein detection of said activity is indicative of said 
modified polypeptide being a variant.. 

In yet another aspect, the invention extends to a method of screening for an agent 
30 that modulates the expression of a gene or variant thereof or the level and/or functional 
activity of an expression product of said gene or variant thereof, wherein said gene is 
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selected from xabB, xabA, or xabC, or a gene belonging to the same regulatory or 
biosynthetic pathway as xabB, xabA, or xabC, said method comprising: 

- contacting a preparation comprising a polypeptide encoded by said gene, or 
biologically active fragment of said polypeptide, or variant or derivative of these, or a 

5 genetic sequence {e.g., a transcriptional control element) that modulates the expression 
of said gene or variant thereof, with a test agent; and 

- detecting a change in the level and/or functional activity of said polypeptide or 
biologically active fragment thereof, or variant or derivative, or of a product expressed 
from said genetic sequence. 

10 The transcriptional control element preferably comprises the sequence set forth in 

SEQ ID NO: 8 1 or complement thereof. 

The invention, in another aspect, also provides a method for enhancing the level 
and/or functional activity of an albicidin, said method comprising: 

- introducing into an albicidin-producing host cell (1) an agent that modulates 
15 the expression of a gene encoding at least a portion of an albicidin PKS-NRPS or 

variant or derivative thereof, or the level and/or functional activity of an expression 
product of said gene, or (2) a vector from which a polynucleotide encoding at least a 
portion of an albicidin PKS-NRPS or variant or derivative thereof can be translated; 

- and culturing the host cell for a time and under conditions sufficient to 
20 enhance the level and/or functional activity of said albicidin. 

Preferably, the method further comprises introducing into said host cell a vector 
from which a PPTase can be translated. Suitably, the PPTase is selected from EntD or 
XabA. 

Preferably, the method further comprises introducing into said host cell a vector 
25 from which a methyltransferase, more preferably and O-methyltransferase, and even more 
preferably an 5-adenosylmethionine O-methyltransferase can be translated. 

According to another aspect of the invention, there is provided a method for 
enhancing the level and/or functional activity of an albicidin, said method comprising 
contacting a precursor of said albicidin or an intermediate involved in the biosynthesis of 
30 said albicidin with at least a portion of an albicidin PKS-NRPS, or variant or derivative 



WO 02/24736 PCT/AUO 1/0 1190 

-18- 

thereof, as broadly described above, for a time and under conditions sufficient to enhance 
the level and/or functional activity of said albicidin. 

Preferably, the method further comprises contacting a precursor of said albicidin 
or an intermediate involved in the biosynthesis of said albicidin with a PPTase. 

5 Preferably, the method further comprises contacting a precursor of said albicidin 

or an intermediate involved in the biosynthesis of said albicidin with a methyltransferase, 
more preferably and O-methyltransferase, and even more preferably an S- 
adenosylmethionine 0-methyltransferase. 

In another aspect, the invention provides a method of identifying a PPTase for 
10 enhancing the level and/or functional activity of an albicidin, said method comprising 
introducing into an albicidin-deficient strain of X. albilineans which lacks xabA a vector 
comprising a polynucleotide encoding a test PPTase, wherein said polynucleotide is 
operably linked to a regulatory polynucleotide, and detecting production of albicidin. 

Suitably, the strain is LSI 56 described herein. 

1 5 Preferably, the PPTase is EntD. 

The invention, in another aspect, also provides a method for enhancing the level 
and/or functional activity of an albicidin, said method comprising: 

- introducing into an albicidin-producing host cell (1) an agent that modulates 
the expression of a gene encoding at least a portion of a PPTase associated with 

20 albicidin biosynthesis or variant or derivative thereof, or the level and/or functional 

activity of an expression product of said gene, or (2) a vector from which a 
polynucleotide encoding at least a portion of a PPTase associated with albicidin 
biosynthesis or variant or derivative thereof can be translated; 

- and culturing the host cell for a time and under conditions sufficient to 
25 enhance the level and/or functional activity of said albicidin 

In yet another aspect, the invention provides a method for enhancing the level 
and/or functional activity of an albicidin, said method comprising: 

- introducing into an albicidin-producing host cell (1) an agent that modulates 
the expression of a gene encoding at least a portion of a methyltransferase associated 
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with albicidin biosynthesis or variant or derivative thereof, or the level and/or 
functional activity of an expression product of said gene, or (2) a vector from which 
a polynucleotide encoding at least a portion of a methyltransferase associated with 
albicidin biosynthesis or variant or derivative thereof can be translated; 
5 - and culturing the host cell for a time and under conditions sufficient to 

enhance the level and/or functional activity of said albicidin 

In another aspect, the invention resides in an antigen-binding molecule that is 
immuno-interactive with a polypeptide, fragment, variant or derivative as broadly 
described above. 

10 In yet another aspect, the invention provides a method to prepare a polynucleotide 

encoding a modified PKS, comprising using an albicidin PKS-NRPS encoding nucleotide 
sequence as a scaffold and modifying the portions of the nucleotide sequence that encode 
enzymatic activities, either by mutagenesis, inactivation, deletion, insertion, or 
replacement. 

15 In still yet another aspect, the invention contemplates a method for producing 

polyketides, comprising expressing the modified albicidin PKS encoding nucleotide 
sequence as broadly described in a suitable host cell to thereby produce a polyketide 
different from that produced by the albicidin PKS-NRPS. 

Another aspect of the invention contemplates the insertion of portions of the 
20 albicidin PKS-NRPS coding sequence into other PKS coding sequences to modify the 
products thereof. 

In a further aspect, the invention encompasses use of the polypeptide, fragment, 
variant or derivative as broadly described above, or the polynucleotide or vector as broadly 
described above, or the modulatory agent as broadly described above for producing 
25 secondary metabolites, preferably albicidins. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 is a schematic representation showing a physical and functional map of 
part of the albicidin biosynthetic gene cluster. (A). Partial physical map of the Tn5 
insertion locus in LSI 57 genomic DNA. Restriction enzymes used: C, Clal; E, EcdRI; S, 
5 Spell N, Noil; and B, BamKL. (B). Probes used to recover clone pXABB: Probe 1, 1.4-kb 
EcdRI-NotI fragment digested from pBC157; and probe 2, 0.9-kb PCR product amplified 
from Xal3 genomic DNA using primers complementary to sequences flanking the Tn5 
insertion in LS157. (C). Clones and subclones used for sequencing, and described in Table 
1. (D). The transcription directions of three putative ORFs in 16.5-kb EcoKL fragment are 

10 indicated by arrows. (E). Organisation of X albUineans XabB constructed by comparison 
with known protein sequences. The unshaded box indicates PKS region, and the shade box 
indicates NRPS region. Relative positions of potential catalytic domains or active sites are 
indicated by: AL, acyl-CoA ligase; ACP, acyl carrier protein; KS, /?-ketoacyl synthase; 
KR, jS-ketoacyl reductase; PCP, peptidyl carrier protein; C, condensation; A, adenylation. 

1 5 Horizontal bars indicate proposed biosynthetic modules. 

Figure 2 is a diagrammatic representation presenting the sequence of the region 
upstream from xabB. The nucleotide sequence is numbered according to the 16511 -bp 
sequence in GenBank accession no. AF239749. The putative -35 and -10 promoter 
sequences of xabB and the divergent gene xatA are underlined, as are ribosome-binding 
20 sequences. The transcriptional directions of xabB and xatA are indicated by auows. 
Translational start codons are indicated by boldface type. Primers P1F1 and P1R are 
shaded. 

Figure 3 is a diagrammatic representation showing the alignment of X albUineans 
XabB enzymatic domains with those of PKSs and FASs from other organisms. Identical 

25 amino acids are indicated by boldface type. Stars and overlines identify conserved amino 
acids at catalytic sites. Xal-XabB, X albUineans XabB for biosynthesis of albicidin (this 
study); Hin-LCFA, Haemophilus influenza long-chain fatty acid-CoA ligase (P46450); 
Bsu-PksJ, 5. subtilis polyketide synthase J (P40806); Bsu-MycA, 5. subtilis MycA for 
biosynthesis of mycosubtilin (AF1 84956); Pcr-ComL2, Petroselinum crispum 4- 

30 coumarate-CoA ligase 2 (P14913); Sma-FkbB, S. sp. MA6548 FkbB for biosynthesis of 
FK506 (AF082099); Ame-RifA, Amycolatopsis mediterranei RifA for biosynthesis of 
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rifamycin B (AF040570); Shy-RapA, S. hygroscopicus RapA for biosynthesis of 
rapamycin (X86780); Mxa-Tal, M xanthus Tal for biosynthesis of TA (AJ006977); Ser- 
EryAl and EryA3, S. erythraea EryA modules for biosynthesis of erythromycin (M63676, 
M63677); Che-PKSl, Cochliobolus heterostrophus PKS1 for biosynthesis of T-toxin 
5 (U68040); Bsu-PksM, 5. subtilis PKS for a polyketide synthase (031781); Mtu-PpsA, M 
tuberculosis PKS for a polyketide synthase (G3261605); Mtu-MAS, M tuberculosis MAS 
for biosynthesis of mycocerosic acid (M95808); Chick-FAS, chichen fatty acid synthase 
(M22987); Rat-FAS, rat fatty acid synthase (X14175). 

Figure 4 is a graphical representation showing albicidin production by wild-type 
10 X albilineans LSI 55 (▲), complemented Tox" mutant strain LSI 57 pLXABBl (O), 
complemented Tox" mutant strain LS157 pLXABB2 (•), LS157 (■), and LS157 
pLAFR3 (+). Albicidin concentrations in culture supernatants were quantified based on 
inhibition zone width in a microbial bioassay (means +/- standard errors from 5 replicates). 

Figure 5 is a graphical representation showing the relationship between growth 
15 (■), albicidin production (O), and GUS activity (A) in X albilineans LSI 55 pRG960pl 
(A) and in LSI 55 pRG960p2 (B). Relative activity (means +/- standard errors from 2 
replicates): 100% growth, OD 550 = 1.43; 100% albicidin production = 268.5 units/ml; 
100% GUS activity = 119 units/mg of protein (one unit equals 1 pmol of 
methylumbelliferone formed per min.). Locations and sizes of inserts on pRG960pl and 
20 pRG960p2 are indicated in Figure 2 and Table 1. GUS, ^-glucuronidase. 

Figure 6 is a schematic representation showing the organisation of five known 
PKS-NRPS enzymes. X. albilineans XabB, encoded by xabB for albicidin biosynthesis 
(this study); B. subtilis MycA for mycosubtilin biosynthesis (Duitman et al, 1999); 
Yersinia pestis HMWP1 for yersiniabactin biosynthesis (Gehring et al, 1998); M xanthus 
25 partial gene product Tal for TA biosynthesis (Paitan et al, 1999); B. subtilis PksorfX6 for 
unknown function (Albertini et al, 1995). Unshaded boxes indicate PKS regions, grey 
boxes indicate NRPS regions, and dark boxes indicate amino transferase (AMT) or 
methyltransferase (MT). Vertical bars follow the carrier domains at the end of each 
biosynthetic "module". 

30 Figure 7 is a diagrammatic representation showing a dendrogram (GCG) analysis 

of adenylation domains of XabB and its homologous peptide synthetases. Peptide 
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synthetases, including various modules of the same multienzyme complex, are as follows: 
GrsA and GrsB, gramicidin synthetase A and B, respectively, from B. subtilis (X15577, 
X61658); BacA, BacB, and BacC, bacitracin synthetase A, B, and C, respectively, from B. 
licheniformis (AF007865); SnbC and SnbDE, pristinamycin I synthetase C and DE, 
5 respectively, from S. pristinaespiralis (X98690, Y11547); FkbP, FK506 synthetase FkbP 
from & sp. MA6548 (AF082100); TycA, TycB, and TycC, tyrocidine synthetase A, B, and 
C, respectively, from B. brevis (AF004835); SyrE, syringomycin synthetase El from 
Pseudomonas syringae pv. syringae (AF047828); EntF, enterobactin synthetase F from E. 
coli (PI 1454); DhbF, 2,3-dihydroxybenzoate synthetase F from B. subtilis (P45745); 
10 FenD, fengycin synthetase FenDl from B. subtilis (AJ011849); SrfAA, SrfAB, and SrfAC, 
surfactin A synthetase A, B, and C, respectively, from B. subtilus (X70356); XabB, 
albicidin synthase B froml albilineans (this study). The A4 to A5 regions (about 100 aa) 
of adenylation domains of peptide synthetases, which is involved in amino acid recognition 
and binding, were aligned using the PILEUP program with default parameters. 

15 Figure 8 is a diagrammatic representation showing a restriction map of clones 

including the xabA gene from X albilineans. Sequencing by primer walking commenced at 
the T3 and T7 primers. The location and direction of transcription of the xabA ORF is 
shown by an arrow. Restriction enzymes are: E, EcoRI; P, Pstl; C, CM; and H, HindHl 

Figure 9 is a diagrammatic representation presenting the sequence of the xabA 
20 gene. The nucleotide sequence is numbered according to the 3-kb sequence in GenBank 
accession no. AF191324. The closest matches to RBS region and promoter consensus 
sequences are underlined, as are the region of dyad symmetry and putative factor- 
independent termination sites. Translation start and stop codons are indicated by boldface 
type. The (V/T)G(V/I)D and (F/W)(S/C/T)xKE(A/S)xxK motifs conserved in PPTase 
25 enzymes are boxed. The insertion site of Tn5 is marked (T). 

Figure 10 is a graphical representation showing albicidin production by wild-type 
X albilineans strain Xal3 (O), Xal3 pLXABA (•), and complemented Tox" mutant strain 
LSI 56 pLXABA (A). Albicidin concentrations in culture supernatants were quantified 
based on inhibition zone width in a microbial bioassay (means +/- standard errors from 2 
30 replicates). 
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Figure 11 is a schematic representation showing a dendrogram (GCG) analysis of 
PPTases involved in antibiotic and fatty acid biosynthesis in bacteria. Sau, Salmonella 
austin; Sty, Salmonella typhymurium; Bbr, Bacillus brevis; Xal, Xanthomonas albilineans; 
Eco, Escherichia coli; Sfl, Shigella flexneri; Bpu, Bacillus pumilus; Bsu, Bacillus subtilis\ 
5 Mtu, Mycobacterium tuberculosis; Hin, Haemophilus influenzae. The sources of amino 
acid sequence of PPTases correspond to those in Table 2, and the sequences were aligned 
using the PDLEUP program with default parameters. 

Figure 12 is a schematic representation showing the organisation of part of the 
albicidin biosynthetic gene cluster. The location and direction of three ORFs are indicated 

10 by thick arrows. Vertical lines indicate the position of restriction enzyme sites: E, EcoW\ 
B, BamHI; S, Spel; N, Ncol. The vertical lines with triangles (^) show the position of 
insertional mutagenesis sites or Tn5 insertion site, and the resultant mutants are bracketed. 
The arrows above the physical map indicate the locations of primers used to amplify 
sequence downstream of the EcoRI restriction site by DPCR. The cloned regions for 

1 5 complementation tests are shown below the map. 

Figure 13 is a diagrammatic representation presenting the nucleotide and deduced 
amino acid sequences of the xabC region. The nucleotide sequence is numbered according 
to the 1515-bp sequence in GenBank accession no. AF239750. The potential RBS and 
selected restriction sites are underlined. The putative factor-independent termination 
20 signals are underlined and indicated by bold letters. Translation start and stop codons are 
indicated by bold letters. The conserved motifs in Mtases are boxed. Primers used for PCR 
(A3F and A3R) and IPCR (IR) are shaded. 

Figure 14 is a diagrammatic representation showing the conserved sequence 
motifs in Mtases involved in antibiotic biosynthesis in bacteria. Identical or similar amino 

25 acids (A = G; D = E; I = L = V) are shown in bold. Numbers indicate amino acid residues 
from the N terminus of the protein. Xal-XabC, putative albicidin biosynthesis Mtase from 
X albilineans (this study); Sgl-TcmO and Sgl-TcmN, multifunctional cyclase-dehydrase- 
3-OMtase and tetracenomycin polyketide synthesis 8-O-Mtase of Streptomyces 
glaucescens, respectively (accession number M80674); Smy-MdmC, midecamycin-O- 

30 Mtase of S. mycarofaciens (M93958); Mxa-SafC, saframycin 0-Mtase of Myxococcus 
xanthus (U24657); Ser-EryG, erythromycin biosynthesis O-Mtase of Saccharopolyspora 
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10 



erythraea (S18533); Spe-DauK, caiminomycin 4-0-Mtase from S. peucetius (L13453); 
Sal-DmpM, O-demethylpuromycin-O-Mtase from S. alboniger (M74560); Shy-RapM, 
rapamycin tf-Mtase of S. hygroscopicus (X86780); Sav-AveD, avermectin B 5-O-Mtase 
from S. overmitilis (G5921167). 

Figure 15 is a graphical representation showing albicidin production by wild-type 
X. albilineans LS155 (o), Tox xabC insertion mutant LS-JP2 (□), complemented strain 
LS-JP2 pLXABC containing Lac promoter - full length *oZ>C gene (O), and complemented 
strain LS-JP2 pLXABBl containing full length xabB plus functional N-terminal region of 
xabC (IT). Albicidin concentrations in culture supernatants were quantified based on 
inhibition zone width in a microbial bioassay (means +/- standard errors from 2 or 3 
replicates). 
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BRIEF DESCRIPTION OF THE SEQUENCES: SUMMARY TABLE 



TABLE A 



SEQUENCE ID 
• NUMBER 


SEQUENCE 


LENGTH 


SEQIDNO: 1 


Full-length xabB (Accession No. AF239749) 


16551 bases 


SEQIDNO:2 


Full-length polypeptide sequence encoded by SEQ 
ID NO: 1 


4801 residues 


SEQIDNO: 3 


Full-length coding sequence o£xabB 


14406 bases 


SEQIDNO: 4 


Polypeptide sequence encoded by SEQ ED NO: 3 


4801 residues 


SEQ ID NO: 5 


Sub-sequence of SEQ ID NO: 1 and 3 encoding acyl- 
CoA ligase subdomain I 


45 bases 


SEQIDNO: 6 


Acyl-CoA ligase subdomain I encoded by SEQ ID 
NO: 5 


15 residues 


SEQ ID NO: 7 


Sub-sequence of SEQ ID NO: 1 and 3 encoding acyl- 
CoA ligase subdomain II 


24 bases 


SEQ ID NO: 8 


Acyl-CoA ligase subdomain I encoded by SEQ ID 
NO: 7 


8 residues 


SEQ ID NO: 9 


Sub-sequence of SEQ ID NO: 1 and 3 encoding 0- 
ketoacyl synthase 1 subdomain I 


51 bases 


SEQIDNO: 10 


j8-Ketoacyl synthase 1 subdomain I encoded by SEQ 
ID NO: 9 


17 residues 


SEQIDNO: 11 


Sub-sequence of SEQ ID NO: 1 ana 3 encoding p- 
ketoacyl synthase 1 subdomain II 


d\) oases 


SEQIDNO: 12 


j3-Ketoacyl synthase 1 subdomain II encoded by SEQ 
ID NO: 11 


10 residues 


SEQIDNO: 13 


Sub-sequence of SEQ ID NO: 1 and 3 encoding /?- 
ketoacyl synthase 1 subdomain HI 


30 bases 


SEQ ID NO: 14 


j3-Ketoacyl synthase 1 subdomain HI encoded by 
SEQ ID NO: 13 


10 residues 


SEQIDNO: 15 


Sub-sequence of SEQ ID NO: 1 and 3 encoding /?- 
ketoacyl synthase 2 subdomain I 


51 bases 
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SEQUENCE ID . 
NUMBER 


SEQUENCE 


LENGTH 


SEQIDNO: 16 


jS-Ketoacyl synthase 2 subdomain I encoded by SEQ 
ID NO: 15 


17 residues 


SEQIDNO: 17 


Sub-sequence of SEQ ID NO: 1 and 3 encoding j8- 
ketoacyl synthase 2 subdomain II 


30 bases 


SEQIDNO: 18 


/J-Ketoacyl synthase 2 subdomain II encoded by SEQ 
ID NO: 17 


10 residues 


SEQIDNO: 19 


Sub-sequence of SEQ ID NO: 1 and 3 encoding j8- 
ketoacyl synthase 2 subdomain III 


30 bases 


SEQIDNO: 20 


j3-Ketoacyl synthase 2 subdomain IQ encoded by 
SEQIDNO: 19 


10 residues 


SEQ ID NO: 21 


Sub-sequence of SEQ ID NO: 1 and 3 encoding j8- 
ketoacyl reductase domain 


93 bases 


SEQIDNO: 22 


/?-Ketoacyl reductase domain encoded by SEQ ID 
NO: 21 


31 residues 


SEQIDNO: 23 


Sub-sequence of SEQ ID NO: 1 and 3 encoding acyl 
carrier protein 1 domain 


36 bases 


SEQ ID NO: 24 


Acyl carrier protein 1 domain encoded by SEQ ID 
NO: 23 


12 residues 


SEQ ID NO: 25 


Sub-sequence of SEQ ID NO: 1 and 3 encoding acyl 
carrier protein 2 domain 


36 bases 


SEQIDNO: 26 


Acyl carrier protein 2 domain encoded by SEQ ID 
NO: 25 


12 residues 


SEQIDNO: 27 


Sub-sequence of SEQ ID NO: 1 and 3 encoding acyl 
carrier protein 3 domain 


36 bases 


SEQ ID NO: 28 


Acyl carrier protein 3 domain encoded by SEQ ID 
NO: 27 


12 residues 


SEQIDNO: 29 


Sub-sequence of SEQ ID NO: 1 and 3 encoding 
adenylation domain subdomain I 


18 bases 


SEQIDNO: 30 


Adenylation domain subdomain I encoded by SEQ 
ID NO: 29 


6 residues 


SEQ ID NO: 31 


Sub-sequence of SEQ ID NO: 1 and 3 encoding 
adenylation domain subdomain II 


33 bases 
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SEQUENCE ID 
NUMBER. 


SEQUENCE 


LENGTH 


SEQ ID NO: 32 


Adenylation domain subdomain II encoded by SEQ 
ED NO: 31 


11 residues 


SEQ ID NO: 33 


Sub-sequence of SEQ ID NO: 1 and 3 encoding 
adenylation domain subdomain HI 


48 bases 


SEQ ID NO: 34 


Adenylation domain subdomain EH encoded by SEQ 
ED NO: 33 


16 residues 


SEQ ID NO: 35 


Sub-sequence of SEQ ED NO: 1 and 3 encoding 
adenylation domain subdomain IV 


12 bases 


SEQ ID NO: 36 


Adenylation domain subdomain IV encoded by SEQ 
ID NO: 35 


4 residues 


SEQ ID NO: 37 


Sub-sequence of SEQ ID NO: 1 and 3 encoding 
adenylation domain subdomain V 


21 bases 


SEQ ID NO: 38 


Adenylation domain subdomain V encoded by SEQ 
ID NO: 37 


7 residues 


SEQ ID NO: 39 


Sub-sequence of SEQ ID NO: 1 and 3 encoding 
adenylation domain subdomain VI 


45 bases 


SEQ ID NO: 40 


Adenylation domain subdomain VI encoded by SEQ 
ED NO: 39 


15 residues 


SEQ ID NO: 41 


Sub-sequence of SEQ ED NO: 1 and 3 encoding 
adenylation domain subdomain VII 


18 bases 


SEQ ID NO: 42 


Adenylation domain subdomain VEt encoded by SEQ 
ID NO: 41 


6 residues 


SEQ ID NO: 43 


Sub-sequence of SEQ ED NO: 1 and 3 encoding 
adenylation domain subdomain VEQ 


60 bases 


SEQ ID NO: 44 


Adenylation domain subdomain VHI encoded by 
SEQ ID NO: 43 


20 residues 


SEQ ED NO: 45 


Sub-sequence of SEQ ID NO: 1 and 3 encoding 
adenylation domain subdomain DC 


21 bases 


SEQ ID NO: 46 


Adenylation domain subdomain EX encoded by SEQ 
ED NO: 45 


7 residues 


SEQ ED NO: 47 


Sub-sequence of SEQ ED NO: 1 and 3 encoding 
adenylation domain subdomain X 


18 bases 
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SEQUENCE ID 
NUMBER 


SEQUENCE 


LENGTH 


SEQ ID NO: 48 


Adenylation domain subdomain X encoded by SEQ 
ID NO: 47 


6 residues 


SEQIDNO: 49 


Sub-sequence of SEQ ID NO: 1 and 3 encoding 
peptidyl carrier protein 1 domain 


33 bases 


SEQ ID NO: 50 


Peptidyl carrier protein 1 domain encoded by SEQ 
ID NO: 49 


1 1 residues 


SEQIDNO: 51 


Sub-sequence of SEQ ID NO: 1 and 3 encoding 
peptidyl carrier protein 2 domain 


33 bases 


SEQIDNO: 52 


Peptidyl carrier protein 2 domain encoded by SEQ 
ID NO: 51 


1 1 residues 


SEQIDNO: 53 


Sub-sequence of SEQ ID NO: 1 and 3 encoding 
condensation domain 1 subdomain I 


30 bases 


SEQIDNO: 54 


Condensation domain 1 subdomain I encoded by 
SEQIDNO: 53 


10 residues 


SEQIDNO: 55 


Sub-sequence of SEQ ID NO: 1 and 3 encoding 
condensation domain 1 subdomain II 


27 bases 


SEQIDNO: 56 


Condensation domain 1 subdomain II encoded by 
SEQIDNO: 55 


9 residues 


SEQ ID NO: 57 


Sub-sequence of SEQ ID NO: 1 and 3 encoding 
condensation domain 1 subdomain HE 


30 bases 


SEQ ID NO: 58 


Condensation domain 1 subdomain m encoded by 
SEQIDNO: 57 


10 residues 


SEQIDNO: 59 


Sub-sequence of SEQ ID NO: 1 and 3 encoding 
condensation domain 1 subdomain IV 


21 bases 


SEQIDNO: 60 


Condensation domain 1 subdomain TV encoded by 
SEQIDNO: 59 


7 residues 


SEQ ID NO: 61 


Sub-sequence of SEQ ID NO: 1 and 3 encoding 
condensation domain 1 subdomain V 


36 bases 


SEQ ID NO: 62 


Condensation domain 1 subdomain V encoded by 
SEQIDNO: 61 


12 residues 


SEQ ID NO: 63 


Sub-sequence of SEQ ID NO: 1 and 3 encoding 
condensation domain 1 subdomain VI 


21 bases 
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SEQUENCE ID 
NUMBER 


SEQUENCE 


LENGTH 


SEQ ID NO: 64 


Condensation domain 1 subdomain VI encoded by 
SEQIDNO: 63 


7 residues 


SEQIDNO: 65 


Sub-sequence of SEQ ID NO: 1 and 3 encoding 
condensation domain 1 subdomain VII 


24 bases 


SEQ ID NO: 66 


Condensation domain 1 subdomain VII encoded by 
SEQ ID NO: 65 


8 residues 


SEQ ID NO: 67 


Sub-sequence of SEQ ID NO: 1 and 3 encoding 
condensation domain 2 subdomain I 


30 bases 


SEQ ID NO: 68 


Condensation domain 2 subdomain I encoded by 
SEQIDNO: 67 


10 residues 


SEQIDNO: 69 


Sub-sequence of SEQ ID NO: 1 and 3 encoding 
condensation domain 2 subdomain II 


27 bases 


SEQ ID NO: 70 


Condensation domain 2 subdomain II encoded by 
SEQ ID NO: 69 


9 residues 


SEQ ID NO: 71 


Sub-sequence of SEQ ID NO: 1 and 3 encoding 
condensation domain 2 subdomain HI 


30 bases 


SEQ ID NO: 72 


Condensation domain 2 subdomain III encoded by 
SEQ ID NO: 71 


10 residues 


SEQ ID NO: 73 


Sub-sequence of SEQ ID NO: 1 and 3 encoding 
condensation domain 2 subdomain IV 


21 bases 


SEQ ID NO: 74 


Condensation domain 2 subdomain IV encoded by 
SEQIDNO: 73 


7 residues 


SEQ ID NO: 75 


Sub-sequence of SEQ ID NO: 1 and 3 encoding 
condensation domain 2 subdomain V 


33 bases 


SEQIDNO: 76 


Condensation domain 2 subdomain V encoded by 
SEQ ID NO: 75 


1 1 residues 


SEQ ID NO: 77 


Sub-sequence of SEQ ID NO: 1 and 3 encoding 
condensation domain 2 subdomain VI 


21 bases 


SEQ ID NO: 78 


Condensation domain 2 subdomain VI encoded by 
SEQIDNO: 77 


7 residues 


SEQ ID NO: 79 


Sub-sequence of SEQ ID NO: 1 and 3 encoding 
condensation domain 2 subdomain VII 


24 bases 
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SEQUENCE ID 
NUMBER 


SEQUENCE. 


LENGTH 


<s"PO TTJ NO* 80 


Condensation domain 2 subdomain VII encoded by 
SEQIDNO: 79 


8 residues 


SEQIDNO: 81 


Polynucleotide comprising xabB promoter 


242 bases 


SEQ ID NO: 82 


Full-length *aM (Accession No. AF191324) 


1200 bases 




Fnll-lenolh nolvnentide seauence encoded bv SEO 
ID NO: 82 


278 residues 


SEQ ID NO: 84 


Full-length coding sequence of xabA 


837 bases 


SFO ID NO- 85 


Polvoentide seauence encoded bv SEO ID NO: 84 


278 residues 


SEQIDNO: 86 


Sub-sequence of SEQ ID NO: 82 encoding PPTase 
domain 

Vl \f XXX (4.1 IX 


168 bases 


SEQIDNO: 87 


PPTase domain encoded by SEQ ID NO: 86 


56 residues 


SEQ ID NO: 88 


Sub-sequence of SEQ ID NO: 82 encoding a motif 
(motif I) conserved in PPTases 


27 bases 


SEO ID NO- 89 


PPTase motif I amino acid sequence encoded by 
SEQIDNO: 88 


9 residues 


SEQ ID NO: 90 


Sub-sequence of SEQ ID NO: 82 encoding 
intervening amino acid sequence lmking motifs I and 

n 


117 bases 


SEQ ID NO: 91 


Intervening amino acid sequence encoded by SEQ ID 
NO- 90 


39 residues 


SEQ ID NO: 92 


Sub-sequence of SEQ ID NO: 82 encoding a motif 
fmotif rn conserved in PPTases 

IlilUlll -Li- J WJllot'X V WX jxi XXX aowa 


36 bases 


SEO ID NO- 93 


PPTase motif II amino acid seauence encoded bv 
SEQIDNO: 92 


12 residues 


SEQ ID NO: 94 


Full-length xabC (Accession No. AF239750) 


1515 bases 


SEQ ID NO: 95 


Full-length polypeptide sequence encoded by SEQ 
ID NO: 94 


343 residues 


SEQ ID NO: 96 


Full-length coding sequence of xabC 


1029 bases 


SEQIDNO: 97 


Polypeptide sequence encoded by SEQ ID NO: 96 


343 residues 
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SEQUENCE ID 
NU1VLBER 


SEQUENCE 


LENGTH 


SEQ ID NO: 98 


Sub-sequence of SEQ ID NO: 94 encoding a motif 
(motif I) conserved in methyltransferases 


21 bases 


SEQIDNO: 99 


Methyltransferase motif I amino acid sequence 
encoded by SEQ ID NO: 98 


7 residues 


SEQ ID NO: 100 


Sub-sequence of SEQ ID NO: 94 encoding a motif 
(motif H) conserved in methyltransferases 


24 bases 


SEQ ID NO: 101 


Methyltransferase motif E amino acid sequence 
encoded bv SEO ED NO: 100 


8 residues 


<ZVCi IT) NO- 102 


Sub-seauence of SEO ID NO: 94 encoding a motif 
(motif HI) conserved in methyltransferases 


27 bases 


SEQ ID NO: 103 


Methyltransferase motif HI amino acid sequence 
encoded by SEQ ID NO: 102 


9 residues 


SEQ ID NO: 104 


Polynucleotide encoding said motifs I, II and EQ 


303 bases 


SEQ ID NO: 105 


Polypeptide encoded by SEQ ID NO: 104 


101 residues 


SEQ ID NO: 106 


Biologically active fragment of SEQ ID NO: 94 


831 bases 


SEQIDNO: 107 


Biologically active fragment of SEQ ID NO: 95 


277 residues 
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DETAILED DESCRIPTION OF THE INVENTION 
L Definitions 

Unless defined otherwise, all technical and scientific terms used herein have the 
same meaning as commonly understood by those of ordinary skill in the art to which the 
5 invention belongs. Although any methods and materials similar or equivalent to those 
described herein can be used in the practice or testing of the present invention, preferred 
methods and materials are described. For the purposes of the present invention, the 
following terms are defined below. 

The articles "a " and "an" are used herein to refer to one or to more than one (i.e. 
10 to at least one) of the grammatical object of the article. By way of example, "an element" 
means one element or more than one element. 

The term "about" is used herein to refer to sequences that vary by as much as 
30%, preferably by as much as 20% and more preferably by as much as 10% to the length 
of a reference sequence. 

15 By "agent" is meant a naturally occurring or synthetically produced molecule 

which interacts either directly or indirectly with a target member, the level and/or 
functional activity o f which are to be modulated. 

"Amplification product" refers to a nucleic acid product generated by nucleic acid 
amplification techniques. 

20 By "antigen-binding molecule " is meant a molecule that has binding affinity for a 

target antigen. It will be understood that this term extends to immunoglobulins, 
immunoglobulin fragments and non-immunoglobulin derived protein frameworks that 
exhibit antigen-binding activity. 

As used herein, the term "binds specifically" and the like refers to antigen- 
25 binding molecules that bind the polypeptide or polypeptide fragments of the invention but 
do not significantly bind to homologous prior art polypeptides. 
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By "biologically active fragment" is meant a fragment of a full-length parent 
polypeptide which fragment retains the activity of the parent polypeptide. A biologically 
active fragment will therefore comprise an activity selected form the group consisting of 
acyl-CoA ligase activity, /3-ketoacyl synthase activity, j3-ketoacyl reductase, acyl carrier 
5 protein activity, adenylation activity, peptidyl carrier protein activity, condensation 
activity, PPTase activity and methyltransferase activity. As used herein, the term 
"biologically active fragment" includes deletion mutants and small peptides, for example 
of at least 10, preferably at least 20 and more preferably at least 30 contiguous amino 
acids, which comprise the above activities. Peptides of this type may be obtained through 

10 the application of standard recombinant nucleic acid techniques or synthesised using 
conventional liquid or solid phase synthesis techniques. For example, reference may be 
made to solution synthesis or solid phase synthesis as described, for example, in Chapter 9 
entitled "Peptide Synthesis" by Atherton and Shephard which is included in a publication 
entitled "Synthetic Vaccines" edited by Nicholson and published by Blackwell Scientific 

15 Publications. Alternatively, peptides can be produced by digestion of a polypeptide of the 
invention with proteinases such as endoLys-C, endoArg-C, endoGlu-C and staphylococcus 
V8-protease. The digested fragments can be purified by, for example, high performance 
liquid chromatographic (HPLC) techniques. 

Throughout this specification, unless the context requires otherwise, the words 
20 "comprise "comprises " and "coinprising" will be understood to imply the inclusion of a 
stated step or element or group of steps or elements but not the exclusion of any other step 
or element or group of steps or elements. 

By "corresponds to" or "corresponding to 9 ' is meant a polynucleotide (a) having a 
nucleotide sequence that is substantially identical or complementary to all or a portion of a 
25 reference polynucleotide sequence or (b) encoding an amino acid sequence identical to an 
amino acid sequence in a peptide or protein. This phrase also includes within its scope a 
peptide or polypeptide having an amino acid sequence that is substantially identical to a 
sequence of amino acids in a reference peptide or protein. 

By "derivative" is meant a polypeptide that has been derived from the basic 
30 sequence by modification, for example by conjugation or complexing with other chemical 
moieties or by post-translational modification techniques as would be understood in the art. 



WO 02/24736 



PCT/AU01/01190 



■34 



The term "derivative" also includes within its scope alterations that have been made to a 
parent sequence including additions, or deletions that provide for functionally equivalent 
molecules. Accordingly, the term derivative encompasses molecules that will have an 
activity selected form the group consisting of acyl-CoA ligase activity, 0-ketoacyl synthase 
5 activity, 0-ketoacyl reductase, acyl carrier protein activity, adenylation activity, peptidyl 
carrier protein activity, condensation activity, PPTase activity and methyltransferase 
activity. 

"Homology" refers to the percentage number of amino acids that are identical or 
constitute conservative substitutions as defined in Table B infra. Homology may be 
10 determined using sequence comparison programs such as GAP (Deveraux et al. 1984, 
Nucleic Acids Research 12, 387-395). In this way, sequences of a similar or substantially 
different length to' those cited herein might be compared by insertion of gaps into the 
alignment, such gaps being determined, for example, by the comparison algorithm used by 
GAP. 

1 5 "Hybridisation" is used herein to denote the pairing of complementary nucleotide 

sequences to produce a DNA-DNA hybrid or a DNA-RNA hybrid. Complementary base 
sequences are those sequences that are related by the base-pairing rules. In DNA, A pairs 
with T and C pairs with G. In RNA U pairs with A and C pairs with G. In this regard, the 
terms "match" and "mismatch" as used herein refer to the hybridisation potential of paired 

20 nucleotides in complementary nucleic acid strands. Matched nucleotides hybridise 
efficiently, such as the classical A-T and G-C base pair mentioned above. Mismatches are 
other combinations of nucleotides that do not hybridise efficiently. 

Reference herein to "immuno-inter active" includes reference to any interaction, 
reaction, or other form of association between molecules and in particular where one of the 
25 molecules is, or mimics, a component of the immune system. 

By "immuno-interactive fragment" is meant a fragment of a parent or reference 
polypeptide as described herein, which fragment elicits an immune response, including the 
production of elements that specifically bind to said polypeptide, or variant or derivative 
thereof. As used herein, the term "immuno-interactive fragment" includes deletion mutants 
30 and small peptides, for example of at least six, preferably at least 8 and more preferably at 
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least 20 contiguous amino acids, which comprise antigenic determinants or epitopes. 
Several such fragments may be joined together. 

By "isolated" is meant material that is substantially or essentially free from 
components that normally accompany it in its native state. For example, an "isolated 
5 polynucleotide", as used herein, refers to a polynucleotide, which has been purified from 
the sequences which flank it in a naturally occurring state, e.g., a DNA fragment which has 
been removed from the sequences which are normally adjacent to the fragment. 

By "modulating" is meant increasing or decreasing, either directly or indirectly, 
the level and/or functional activity of a target molecule. For example, an agent may 
10 indirectly modulate the said level/activity by interacting with a molecule other than the 
target molecule. In this regard, indirect modulation of a gene encoding a target polypeptide 
includes within its scope modulation of the expression of a first nucleic acid molecule, 
wherein an expression product of the first nucleic acid molecule modulates the expression 
of a nucleic acid molecule encoding the target polypeptide. 

15 By "obtained from" is meant that a sample such as, for example, a nucleic acid 

extract or polypeptide extract is isolated from, or derived from, a particular source. For 
example, the extract may be isolated directly from any organism that produces secondary 
metabolites, preferably from an albicidin-producing microorganism, more preferably from 
microorganisms of the genus Xanthomonas. 

20 The term "oligonucleotide" as used herein refers to a polymer composed of a 

multiplicity of nucleotide units (deoxyribonucleotides or ribonucleotides, or related 
structural variants or synthetic analogues thereof) linked via phosphodiester bonds (or 
related structural variants or synthetic analogues thereof). Thus, while the term 
"oligonucleotide" typically refers to a nucleotide polymer in which the nucleotides and 

25 linkages between them are naturally occurring, it will be understood that the term also 
includes within its scope various analogues including, but not restricted to, peptide nucleic 
acids (PNAs), phosphoramidates, phosphorothioates, methyl phosphonates, 2-O-methyl 
ribonucleic acids, and the like. The exact size of the molecule may vary depending on the 
particular application. An oligonucleotide is typically rather short in length, generally from 

30 about 10 to 30 nucleotides, but the term can refer to molecules of any length, although the 
term "polynucleotide" or "nucleic acid" is typically used for large oligonucleotides. 
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By "operably linked" is meant that transcriptional and translational regulatory 
nucleic acids are positioned relative to a polypeptide-encoding polynucleotide in such a 
manner that the polynucleotide is transcribed and the polypeptide is translated. 

The term "polynucleotide" or "nucleic acid' as used herein designates mRNA, 
5 RNA, cRNA, cDNA or DNA. The term typically refers to oligonucleotides greater than 30 
nucleotides in length. 

The terms "polynucleotide variant" and "variant" refer to polynucleotides 
displaying substantial sequence identity with a reference polynucleotide sequence or 
polynucleotides that hybridise with a reference sequence under stringent conditions that are 

10 defined hereinafter. These terms also encompass polynucleotides in which one or more 
nucleotides have been added or deleted, or replaced with different nucleotides. In this 
regard, it is well understood in the art that certain alterations inclusive of mutations, 
additions, deletions and substitutions can be made to a reference polynucleotide whereby 
the altered polynucleotide retains the biological function or activity of the reference 

15 polynucleotide. The terms "polynucleotide variant" and "variant" also include naturally 
occurring allelic variants. 

"Polypeptide", "peptide" and "protein" are used interchangeably herein to refer to 
a polymer of amino acid residues and to variants and synthetic analogues of the same. 
Thus, these terms apply to amino acid polymers in which one or more amino acid residues 
20 is a synthetic non-naturally occurring amino acid, such as a chemical analogue of a 
corresponding naturally occurring amino acid, as well as to naturally-occurring amino acid 
polymers. 

The term "polypeptide variant" refers to polypeptides in which one or more 
amino acids have been replaced by different amino acids. It is well understood in the art 

25 that some amino acids may be changed to others with broadly similar properties without 
changing the nature of the activity of the polypeptide (conservative substitutions) as 
described hereinafter. These terms also encompass polypeptides in which one or more 
amino acids have been added or deleted, or replaced with different amino acids. 
Accordingly, polypeptide variants as used herein encompass polypeptides that have an 

30 activity selected form the group consisting of acyl-CoA ligase activity, j8-ketoacyl synthase 
activity, /3-ketoacyl reductase, acyl carrier protein activity, adenylation activity, peptidyl 
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carrier protein activity, condensation activity, PPTase activity and methyltransferase 
activity. 

By "primer" is meant an oligonucleotide which, when paired with a strand of 
DNA, is capable of initiating the synthesis of a primer extension product in the presence of 

5 a suitable polymerising agent. The primer is preferably single-stranded for maximum 
efficiency in amplification but may alternatively be double-stranded. A primer must be 
sufficiently long to prime the synthesis of extension products in the presence of the 
polymerisation agent. The length of the primer depends on many factors, including 
application, temperature to be employed, template reaction conditions, other reagents, and 

10 source of primers. For example, depending on the complexity of the target sequence, the 
oligonucleotide primer typically contains 15 to 35 or more nucleotides, although it may 
contain fewer nucleotides. Primers can be large polynucleotides, such as from about 200 
nucleotides to several kilobases or more. Primers may be selected to be "substantially 
complementary" to the sequence on the template to which it is designed to hybridise and 

1 5 serve as a site for the initiation of synthesis. By "substantially complementary", it is meant 
that the primer is sufficiently complementary to hybridise with a target nucleotide 
sequence. Preferably, the primer contains no mismatches with the template to which it is 
designed to hybridise but this is not essential. For example, non-complementary 
nucleotides may be attached to the 5' end of the primer, with the remainder of the primer 

20 sequence being complementary to the template. Alternatively, non-complementary 
nucleotides or a stretch of non-complementary nucleotides can be interspersed into a 
primer, provided that the primer sequence has sufficient complementarity with the 
sequence of the template to hybridise therewith and thereby form a template for synthesis 
of the extension product of the primer. 

25 "Probe " refers to a molecule that binds to a specific sequence or sub-sequence or 

other moiety of another molecule. Unless otherwise indicated, the term "probe" typically 
refers to a polynucleotide probe that binds to another nucleic acid, often called the "target 
nucleic acid", through complementary base pairing. Probes may bind target nucleic acids 
lacking complete sequence complementarity with the probe, depending on the stringency 

30 of the hybridisation conditions. Probes can be labelled directly or indirectly. 
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The term "recombinant polynucleotide" as used herein refers to a polynucleotide 
formed in vitro by the manipulation of nucleic acid into a form not normally found in 
nature. For example, the recombinant polynucleotide may be in the form of an expression 
vector. Generally, such expression vectors include transcriptional and translational 
5 regulatory nucleic acid operably linked to the nucleotide sequence. - 

By "recombinant polypeptide" is meant a polypeptide made using recombinant 
techniques, i.e., through the expression of a recombinant polynucleotide. 

By "reporter molecule" as used in the present specification is meant a molecule 
that, by its chemical nature, provides an analytically identifiable signal that allows the 
10 detection of a complex comprising an antigen-binding molecule and its target antigen. The 
term "reporter molecule" also extends to use of cell agglutination or inhibition of 
agglutination such as red blood cells on latex beads, and thelike. 

Terms used to describe sequence relationships between two or more 
polynucleotides or polypeptides include "reference sequence", "comparison window", 

15 "sequence identity", "percentage of sequence identity" and "substantial identity". A 
"reference sequence" is at least 12 but frequently 15 to 18 and often at least 25 monomer 
units, inclusive of nucleotides and amino acid residues, in length. Because two 
polynucleotides may each comprise (1) a sequence (i.e., only a portion of the complete 
polynucleotide sequence) that is similar between the two polynucleotides, and (2) a 

20 sequence that is divergent between the two polynucleotides, sequence comparisons 
between two (or more) polynucleotides are typically performed by comparing sequences of 
the two polynucleotides over a "comparison window" to identify and compare local 
regions of sequence similarity. A "comparison window" refers to a conceptual segment of 
at least 6 contiguous positions, usually about 50 to about 100, more usually about 100 to 

25 about 150 in which a sequence is compared to a reference sequence of the same number of 
contiguous positions after the two sequences are optimally aligned. The comparison 
window may comprise additions or deletions (i.e., gaps) of about 20% or less as compared 
to the reference sequence (which does not comprise additions or deletions) for optimal 
alignment of the two sequences. Optimal alignment of sequences for aligning a comparison 

30 window may be conducted by computerised implementations of algorithms (GAP, 
BESTFIT, FASTA, and TFASTA in the Wisconsin Genetics Software Package Release 
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7.0, Genetics Computer Group, 575 Science Drive Madison, WI, USA) or by inspection 
and the best alignment (i.e. 9 resulting in the highest percentage homology over the 
comparison window) generated by any of the various methods selected. Reference also 
may be made to the BLAST family of programs as for example disclosed by Altschul et 
5 aL, 1997, NucL Acids Res. 25:3389. A detailed discussion of sequence analysis can be 
found in Unit 19.3 of Ausubel et aL, "Current Protocols in Molecular Biology", John 
Wiley & Sons Inc, 1994-1998, Chapter 15. 

The term "sequence identity" as used herein refers to the extent that sequences 
are identical on a nucleotide-by-nucleotide basis or an amino acid-by-amino acid basis 

10 over a window of comparison. Thus, a "percentage of sequence identity" is calculated by 
comparing two optimally aligned sequences over the window of comparison, determining 
the number of positions at which the identical nucleic acid base (e.g., A, T, C, G, I) or the 
identical amino acid residue (e.g., Ala, Pro, Ser, Thr, Gly, Val, Leu, He, Phe, Tyr, Tip, Lys, 
Arg, His, Asp, Glu, Asn, Gin, Cys and Met) occurs in both sequences to yield the number 

15 of matched positions, dividing the number of matched positions by the total number of 
positions in the window of comparison (Le. 9 the window size), and multiplying the result 
by 100 to yield the percentage of sequence identity. For the purposes of the present 
invention, "sequence identity" will be understood to mean the "match percentage" 
calculated by the DNASIS computer program (Version 2.5 for windows; available from 

20 Hitachi Software engineering Co., Ltd., South San Francisco, California, USA) using 
standard defaults as used in the reference manual accompanying the software. 

"Stringency" as used herein, refers to the temperature and ionic strength 
conditions, and presence or absence of certain organic solvents, during hybridisation and 
washing procedures. The higher the stringency, the higher will be the degree of 
25 complementarity between immobilised target nucleotide sequences and the labelled probe 
polynucleotide sequences that remain hybridised to the target after washing. 

"Stringent conditions" refers to temperature and ionic conditions under which 
only nucleotide sequences having a high frequency of complementary bases will hybridise. 
The stringency required is nucleotide sequence dependent and depends upon the various 
30 components present during hybridisation and subsequent washes, and the time allowed for 
these processes. Generally, in order to maximise the hybridisation rate, non-stringent 
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hybridisation conditions are selected; about 20 to 25 °C lower than the thermal melting 
point (TnO- The T m is the temperature at which 50% of specific target sequence hybridises 
to a perfectly complementary probe in solution at a defined ionic strength and pH. 
Generally, in order to require at least about 85% nucleotide complementarity of hybridised 
5 sequences, highly stringent washing conditions are selected to be about 5 to 15 °C lower 
than the T m . In order to require at least about 70% nucleotide complementarity of 
hybridised sequences, moderately stringent washing conditions are selected to Le about 15 
to 30 °C lower than the T m . Highly permissive (low stringency) washing conditions may be 
as low as 50 °C below the Tm, allowing a high level of mis-matching between hybridised 
10 sequences. Those skilled in the art will recognise that other physical and chemical 
parameters in the hybridisation and wash stages can also be altered to affect the outcome of 
a detectable hybridisation signal from a specific level of homology between target and 
probe sequences. Other examples of stringency conditions are described in section 3.3. 

By "vector" is meant a nucleic acid molecule, preferably a DNA molecule 

15 derived, for example, from a plasmid, bacteriophage, or plant virus, into which a nucleic 
acid sequence may be inserted or cloned. A vector preferably contains one or more unique 
restriction sites and may be capable of autonomous replication in a defined host cell 
including a target cell or tissue or a progenitor cell or tissue thereof, or be integrable with 
the genome of the defined host such that the cloned sequence is reproducible. Accordingly, 

20 the vector may be an autonomously replicating vector, i.e., a vector that exists as an 
extrachromosomal entity, the replication of which is independent of chromosomal 
replication, e.g., a linear or closed circular plasmid, an extrachromosomal element, a 
minichromosome, or an artificial chromosome. The vector may contain any means for 
assuring self-replication. Alternatively, the vector may be one which, when introduced into 

25 a cell, is integrated into the genome of the recipient cell and replicated together with the 
chromosome(s) into which it has been integrated. A vector system may comprise a single 
vector or plasmid, two or more vectors or plasmids, which together contain the total DNA 
to be introduced into the genome of the host cell, or a transposon. The choice of the vector 
will typically depend on the compatibility of the vector with the cell into which the vector 

30 is to be introduced. The vector may also include a selection marker such as an antibiotic 
resistance gene that can be used for selection of suitable transformants. Examples of such 
resistance genes are well known to those of skill in the art. 
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As used herein, underscoring or italicising the name of a gene shall indicate the 
gene, in contrast to its protein product, which is indicated by the name of the gene in the 
absence of any underscoring or itahcising. For example, "xabB" shall mean the xabB gene, 
whereas "XabB" shall indicate the protein product of the "xabB" gene. 

5 2. Isolated polypeptides, biologically active fragments, polypeptide variants and 
derivatives 

2.1 Polypeptides of the invention 

2.1.1 Albicidin synthetase 

The present inventor has also isolated a gene {xabB) encoding a large modular 
10 polyketide synthase (PKS) linked to a non-ribosomal peptide synthetase (NRPS) (predicted 
Mr 525,695). At 4801 amino acids in length, the product of xabB (XabB) is the largest 
reported PKS-NRPS. Comparison of XabB with available protein sequence databases 
reveals an N-terminal region (from Met-1 to Asp-3235) similar to many microbial modular 
PKSs, and a C-terminal region (from Pro-3236 to Asp-4801) similar to NRPSs. 
15 Recognisable PKS domains commencing at the N-terminus of XabB, are an acyl-CoA 
ligase (AL), acyl carrier protein (ACPI), /3-ketoacyl synthase (KS1), and 0-ketoacyl 
reductase (KR), followed by two consecutive ACPs and one KS (Figure 1). The motifs 
characteristic of these domains are aligned with those from other organisms in Figure 3. 
The AL domain shows 22-30% identity and 50-60% similarity to prokaryotic and 
20 eukaryotic aromatic acid-CoA ligases and long-chain fatty acid-CoA ligases, and contains 
the conserved adenylation core sequence (SGSSG) and the ATPase motif (TGD). The 
three ACP domains show up to 39.2% identity and 78.6% similarity to acyl carrier 
proteins, and all contain a 4'-phosphopantetheinyl binding cofactor box GxDS(l/L) 
(Hopwood and Sherman, 1990), except that A replaces G in ACPI (Figure 3). The two KS 
25 domains show up to 56.1% identity and 80.8% similarity to 0-ketoacyl synthases. Both 
contain motif GPxxxxxxxCSxSL around the active site Cys, and two His residues 
downstream of the active site Cys, in motifs characteristic of these enzymes (Donadio et 
al, 1991; Hopwood, 1997; Huang et ai, 1998). The KR domain shows up to 27.9% 
identity and 61.8% similarity to 0-ketoacyl reductases, and contains the NAD(P)H binding 
30 site GGxGxLG (Scrutton et al. , 1990). 
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At the C-terminus of XabB is an apparent peptide synthetase region linked to the 
PKS module via a peptidyl carrier protein (PCP) domain (Figure 1). The peptide synthetase 
region shows 31-38% identity and 60-63% similarity with members of the peptide 
synthetase family. It displays the ordered condensation, adenylation, and PCP domains 
5 typical of such multienzymes (Marahiel et al, 1997) followed by an extra condensation 
domain. The conserved sequences, characteristic of the domains commonly found in 
peptide synthetases, are compared with those from XabB in Table 2. 

In more detail, the full-length amino acid sequence of the X. albilineans PKS- 
NRPS, presented in SEQ ID NO: 2, extends 4801 residues and includes the following 
1 0 sequence signature motifs : 

(a) acyl-CoA ligase (AL) motif I extending from about residue 226 to about residue 
240, and motif II extending from about residue 486 to about residue 493; 

(b) jS-ketoacyl synthase 1 (KS1) motif I extending from about residue 897 to about 
residue 913, motif II extending from about residue 1038 to about residue 1047, and 

15 motif HI extending from about residue 1080 to about residue 1089; 

(c) jS-ketoacyl synthase 2 (KS2) motif I extending from about residue 2777 to about 
residue 2793, motif II extending from about residue 2918 to about residue 2927, and 
motif HI extending from about residue 2955 to about residue 2964; 

(d) jS-ketoacyl reductase (KR) motif extending from about residue 1812 to about 
20 residue 1842; 

(e) acyl carrier protein 1 (ACPI) motif extending from about residue 667 to about 
residue 678; 

(f) acyl carrier protein 2 (ACP2) motif extending from about residue 2484 to about 
residue 2495; 

25 (g) acyl carrier protein 3 (ACP3) motif extending from about residue 2568 to about 

residue 2579; 

(h) adenylation domain (A) motif I extending from about residue 3806 to about 
residue 3811, motif II extending from about residue 3851 to about residue 3861, motif 
III extending from about residue 3917 to about residue 3932; motif IV extending from 
30 about residue 3967 to about residue 3970, motif V extending from about residue 4063 to 
about residue 4069, motif VI extending from about residue 4114 to about residue 4128, 
motif VH extending from about residue 4152 to about residue 4157, motif VHI 
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extending from about residue 4170 to about residue 4189, motif DC extending from 
about residue 4239 to about residue 4245, and motif X extending from about residue 
4259 to about residue 4264; 

(i) peptidyl carrier protein 1 (PCP1) motif extending from about residue 3261 to 
5 about residue 3271; 

(j) peptidyl carrier protein 2 (PCP2) motif extending from about residue 4306 to 
about residue 4316; 

(k) condensation domain 1 (CI) motif I extending from about residue 3333 to about 
residue 3342, motif II extending from about residue 3381 to about residue 3389, and 

10 motif III extending from about residue 3456 to about residue 3465, motif IV extending 
from about residue 3495 to about residue 3501, motif V extending from about residue 
3606 to about residue 3617, motif VI extending from about residue 3641 to about 
residue 3647, motif VII extending from about residue 3658 to about residue 3665; and 
(1) condensation domain 2 (C2) motif I extending from about residue 4374 to about 

15 residue 4383, motif II extending from about residue 4421 to about residue 4429, and 
motif m extending from about residue 4498 to about residue 4507, motif IV extending 
from about residue 4538 to about residue 4544, motif V extending from about residue 
4649 to about residue 4659, motif VI extending from about residue 4685 to about 
residue 4691, motif VII extending from about residue 4701 to about residue 4708. 

20 From the above signature motifs, it can be deduced that XabB commences with an 

AL domain (residues 1-629) followed by an ACP domain (ACPI, residues 630-731). In 
other PKS systems, an N-terminal AL is involved in activation and incorporation of 3,4- 
dihydroxycyclohexane carboxylic acid, 3-amino-5-hydroxy benzoic acid (AHBA), or long- 
chain fatty acid as a starter (Aparicio et aL, 1996; Motamedi and Shafiee, 1998; Tang et 

25 aL, 1998; Duitman et aL, 1999). The second module in XabB contains a KS (residues 732- 
1165), and a KR (residues 1811-1971) upstream of two ACPs (residues 2457-2522, 2544- 
2613), but lacks any discernable AT domain (Figure 1). The third module contains a KS 
(residues 2630-3046) followed by a PCP (residues 3221-3307) at the start of the XabB 
NKPS region. 

30 Four other fused PKS/NRPS systems (Albertini et aL, 1995; Gehring et aL, 1998; 

Duitman et aL, 1999; Paitan et aL, 1999) are known, three of which lack recognisable AT 
domains (Figure 6). Yersinia pestis HMWP1 contains a typical PKS elongation module 
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(including AT), and an NRPS module with a terminating TE domain. It is the third protein, 
following an AL (YbtE) and NRPS (HMWP2) in the biosynthetic apparatus for 
yersiniabactin (Gehring et al, 1998). B. subtilis MycA bears the closest resemblance to 
XabB, showing PKS initiation and elongation modules linked via an amino transferase 

5 (AMT) domain to the NRPS region. In B subtilis PksK and M. xanthus Tal, the NRPS 
region precedes the PKS region. Separate AT enzymes encoded elsewhere in the genome 
may operate in trans to load the appropriate acyl groups onto the ACPs in the elongation 
modules of these PKSs. Candidates are a malonyl-CoA tranascylase gene (fenF) located 
immediately upstream of mycA (Duitman et al, 1999), and an acyltransferase gene located 

10 20 kb upstream of tal (Paitan et al, 1999). Accordingly, it is believed that one or more 
such transacting AT enzymes may also be involved in connection with the operation of 
XabB. 

From the characteristics of albicidin, and the architecture of the XabB PKS region 
(Figure 1), the inventor considers that: (i) the AL couples coenzyme A to a shikimate- 

15 derived acyl residue in an ATP-dependent reaction, and loads the activated acyl unit onto 
the 4'-phosphopantetheine prosthetic arm of ACPI; (ii) an acyl group is loaded onto ACP2 
or ACP3 by a separate acyltransferase; (iii) the KS1 domain accepts the acyl residue from 
ACPI onto a conserved cysteine residue, then transfers it by decarboxylative condensation 
onto the acyl group tethered to ACP2 or ACP3; (iv) the tethered chain is modified by KR; 

20 (v) the assembled polyketide intermediate is translocated via KS2 onto the 4- 
phosphopantetheine prosthetic arm of PCP1, at the start of the NRPS region. 

The A domain in the NRPS region of XabB contains ten conserved sequences (Al 
to A10, Table 2) identified as AMP, ATP-Mg binding, adenine binding or ATPase sites 
(Turgay et al., 1992; Marahiel et aL, 1997). In other NRPS systems, A domains select and 

25 load a particular amino acid, nonproteinogenic amino, hydroxyl or carboxy acid (Marahiel 
et al, 1997). Substrate specificity is determined at the binding pocket, consisting of a 
stretch of about 100 amino acid residues between highly conserved motif A4 and A5 
(Conti et al, 1997). Sequence alignments for this region reveal some clusters 
corresponding with the loaded substrate (Stachelhaus et al, 1999). The A donain from 

30 XabB falls in a diverse cluster of NRPS modules involved in loading of His, Leu or 
aromatic amino acids (Phe and Tyr) in other NRPS systems (Figure 7). Bared on the 
architecture of the XabB NPRS region, it can be inferred that the polyketide intermediate 
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tethered on PCP1 is accepted by CI and coupled to the amino, hydroxyl, or carboxy acid 
preloaded by A onto PCP2. The final condensation domain at the C-tenninus of XabB is 
probably involved in peptide-chain termination and cyclisation, as in enniatin, HC-toxin, 
rapamycin and FK506 systems (Konz and Marahiel, 1999). 

5 2.1.2 Phosphopantetheinyl transferase associated with albicidin biosynthesis 

The present invention also provides a gene {xabA) from^T. albilineans encoding a 
phosphopantetheinyl transferase (PPTase) associated with XabB function. In this regard, 
XabB contains five earner protein (ACP/PCP) domains, to which the growing polyketide 
or polypeptide chain could be covalently tethered. Each functional ACP or PCP domain 
10 must have a specific serine side chain phosphopantetheinylated by a dedicated PPTase 
(Lambalot et al, 1996). The product of xabA (XabA) fulfils this function and is required 
for post-translational activation of synthetases in the albicidin biosynthetic pathway. 

The full-length amino acid sequence of this X albilineans PPTase, presented in 
SEQ ID NO: 83, extends 278 residues and includes the sequence signature motifs for 

15 PPTases which are located as follows: (I) motif I spanning from about residue 159 to about 
residue 167; and (II) motif II spanning from about residue 207 to about residue 218, of 
SEQ ID NO: 83. The sequence intervening between the two motifs extends from about 
residue 168 to about residue 206 of SEQ ID NO; 83. These conserved sequence motifs and 
the intervening sequence are presented for convenience in SEQ ID NO: 89, 93 and 91, 

20 respectively. 

The deduced xabA gene product has 56-62 % overall similarity to EntD proteins 
for enterobactin biosynthesis and 39-56 % overall similarity to other enzymes in the 
phosphopantetheinyl transferase superfamily. Like entD, xabA includes rarely used codons, 
which may impose post-transcriptional control on the rate of gene product formation 
25 (Coderre & Earhart, 1989). Codon optimisation of xabA may, therefore, be useful for 
enhancing the production of XabA. 



2.1.3 Methyltransferase associated with albicidin biosynthesis 

The invention also provides a gene (xabQ from X albilineans encoding a 
methyltransferase enzyme, more particularly an O-methyltransferase enzyme, which is 
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required for albicidin production and which when expressed above natural levels leads to 
increased levels and/or functional activities of albicidin antibiotics. The full-length amino 
acid sequence of this X albilineans methyltransferase, presented in SEQ ID NO: 95, 
extends 343 residues and includes methyltransferase consensus sequence motifs which are 
5 located as follows: (I) motif I spanning from about residue 173 to about residue 180; (II) 
motif II spanning from about residue 236 to about residue 243; and (HI) motif m spanning 
from about residue 266 to about residue 274, of SEQ ID NO: 95. These conserved 
sequence motifs are presented for convenience in SEQ ID NO: 99, 101 and 103, 
respectively. 

10 2.2 Biologically active fragments 

The invention also contemplates biological fragments of the above polypeptides 
of at least 6 and preferably at least 8 amino acids in length, which comprise an activity 
associated with the domains described above. For example, biologically active fragments 
may be produced according to any suitable procedure known in the art. For example, a 

15 suitable method may include first producing a fragment of a parent polypeptide as 
described in Section 2.1 and then testing the fragment for the appropriate biological 
activity. In one embodiment, the fragment is derived from the albicidin PKS-NRPS of the 
invention and is tested for an activity selected form the group consisting of acyl-CoA 
ligase activity, jS-ketoacyl synthase activity, j3-ketoacyl reductase, acyl carrier protein 

20 activity, adenylation activity, peptidyl carrier protein activity and condensation activity. 

Any assays that detects or preferably measure such activities is contemplated in 
the practice of the present invention. The biologically active fragment suitably comprises 
any one or more of the sequence signature motifs described above, or variants thereof. 
Preferably, the biologically active fragment comprises all said sequence signature motifs, 
25 or variants thereof. 

In another embodiment, the fragment is derived from the PPTase of the invention 
and is tested for PPTase activity according to standard assays known to personr of skill in 
the art. Suitably, the PPTase catalyses the pantetheinylation, more preferably the 
phosphopantetheinylation, of proteins involved in antibiotic biosynthesis, preferably 
30 albicidin biosynthesis. The biologically active fragment preferably comprises the 
consensus sequence motifs set forth in SEQ ID NO: 89 and 93, or variant thereof and thus, 
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more preferably comprises the sequence from about residue 159 to about residue 218, of 
SEQ ID NO: 83. 

In yet another embodiment, the fragment is derived from the methyltransferase of 
the invention and is tested for methyltransferase activity, preferably O-methyltransferase 
5 activity and more preferably S-adenosylmethionine-dependent O-methyltransferase 
activity. Suitably, the methyltransferase catalyses the transfer of one or more methyl 
groups to an antibiotic precursor, more preferably an albicidin precursor or an intermediate 
relating to the biosynthesis of albicidins. The biologically active fragment preferably 
comprises the consensus sequence motifs set forth in SEQ ID NO: 99, 101 and 103, or 

10 variant thereof and thus, more preferably comprises the sequence from about residue 173 
to about residue 274 of SEQ ID NO: 95 (z.e., SEQ ID NO: 105), or variant of said 
sequence. In an especially preferred embodiment, the biologically active fragment 
comprises the sequence from about residue 1 to about residue 277 of SEQ ID NO: 95 (i.e., 
SEQ ID NO: 107), or variant of said sequence. An exemplary polynucleotide encoding this 

1 5 sequence is cloned in vector pLXABB described infra. 

Alternatively, biological activity of the fragment is tested by introducing a 
polynucleotide from which a fragment of a parent polypeptide can be translated into a cell, 
and detecting one or more of the above activities, which is indicative of said fragment 
being a biologically active fragment. In one embodiment, such activity can be assayed by 
20 introducing into an albicidin deficient xabB' X albilineans mutant (e.g., strain LSI 57 
described herein) a polynucleotide from which a PKS-NRPS-associated fragment can be 
produced and assaying for antibiotic activity using a microbial plate assay, as for instance 
described in Example 1. 

In another embodiment embodiment, PPTase activity is assayed by introducing 
25 into an albicidin deficient xabA X albilineans mutant (e.g. , strain LS 1 56 described herein) 
a polynucleotide from which a PPTase-associated fragment can be produced and assaying 
for antibiotic activity using a microbial plate assay, as for instance described in Example 2. 

In yet another embodiment, methyltransferase activity is assayed by introducing 
into an albicidin deficient xabC X albilineans mutant (e.g., strain LS-JP1 described 
30 herein) a polynucleotide from which a methyltransferase-associated fragment can be 
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produced and assaying for antibiotic activity as for example described herein using a 
microbial plate assay, as for instance described in Example 3. 



2.3 Polypeptide variants 

The invention also contemplates polypeptide variants of the polypeptides of the 
5 invention wherein said variants have an activity selected form the group consisting of acyl- 
CoA ligase activity, /3-ketoacyl synthase activity, jS-ketoacyl reductase, acyl carrier protein 
activity, adenylation activity, peptidyl carrier protein activity, condensation activity, 
PPTase activity, and methyltransferase activity. Suitable methods of producing polypeptide 
variants include, for example, producing a modified polypeptide whose sequence is 
10 distinguished from a parent polypeptide as described in Section 2.1 or a biologically active 
fragment thereof by the substitution, deletion and/or addition of at least one amino acid. 
The modified polypeptide is then tested for one or more of said activities, wherein the 
presence of that activity indicates that the modified polypeptide is a variant of the parent 
polypeptide. 

15 In another embodiment, a polypeptide variant is produced by introducing into a 

cell a polynucleotide from which a modified polypeptide can be translated, and detecting 
one or more of the activities described above that are associated with the cell, which is 
indicative of the modified polypeptide being a polypeptide variant. 

In general, variants will have at least 60%, more suitably at least 70%, preferably 
20 at least 80%, and more preferably at least 90% homology to a polypeptide as for example 
shown in SEQ ID NO: 4, or a biological fragment thereof. It is preferred that variants 
display at least 60%, more suitably at least 70%, preferably at least 75%, more preferably 
at least 80%, more preferably at least 85%, more preferably at least 90% and still more 
preferably at least 95% sequence identity with a parent polypeptide as described in Section 
25 2.1 or a biologically active fragment thereof. In this respect, the window of comparison 
preferably spans about the full length of the polypeptide or of the biologically active 
fragment. Suitable variants can be obtained from any secondary metabolite-producing 
organism, and preferably from an albicidin-producing organism. 

Alternatively polypeptide variants according to the invention can be identified 
30 either rationally, or via established methods of mutagenesis (see, for example, Watson, J. 
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D. et al, "MOLECULAR BIOLOGY OF THE GENE", Fourth Edition, 
Benjanun/Cummings, Menlo Park, Calif., 1987). Significantly, a random mutagenesis 
approach requires no a priori information about the gene sequence that is to be mutated. 
This approach has the advantage that it assesses the desirability of a particular mutant 

5 based on its function, and thus does not require an understanding of how or why the 
resultant mutant protein has adopted a particular conformation. Indeed, the random 
mutation of target gene sequences has been one approach used to obtain mutant proteins 
having desired characteristics (Leatherbarrow, R. 1986, J. Prot. Eng. 1: 7-16; Knowles, J. 
R., 1987, Science 236: 1252-1258; Shaw, W. V., 1987, Biochem. J. 246: 1-17; Gerit, J. A. 

10 1987, Chem. Rev. 87: 1079-1105). Alternatively, where a particular sequence alteration is 
desired, methods of site-directed mutagenesis can be employed. Thus, such methods may 
be used to selectively alter only those amino acids of the protein that are believed to be 
important (Craik, C. S., 1985, Science 228: 291-297; Cronin, et al, 1988, Biochem. IT. 
4572-4579; Wilks, et al, 1988, Science 242: 1541-1544). 

1 5 Variant peptides or polypeptides, resulting from rational or established methods of 

mutagenesis or from combinatorial chemistries may comprise conservative amino acid 
substitutions. Exemplary conservative substitutions in a polypeptide or polypeptide 
fragment according to the invention maybe made according to the following table: 

TABLE B 



Original Residue 


Exemplary Substitutions 


Ala 


Ser 


Arg 


Lys 


Asn 


Gin, His 


Asp 


Glu 


Cys 


Ser 


Gin 


Asn 


Glu 


Asp 


Gly 


Pro 
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Asn, vjiii 


T1 
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Leu 


Tip Val 

lie, Val 


Lys 


Arrr /Tin fill 7 

/\rg, uin, vjiu 


Met 


T Tie 

IjCU, lit- j 


rne 


iviei, i-»cu, lyr 


OCI 


Thr 


Thr 


Ser 


Trp 


Tyr 


Tyr 


Trp, Phe 


Val 


He, Leu 



Substantial changes in function are made by selecting substitutions that are less 
conservative than those shown in TABLE B. Other replacements would be non- 
conservative substitutions and relatively fewer of these may be tolerated. Generally, the 

5 substitutions which are likely to produce the greatest changes in a polypeptide's properties 
are those in which (a) a hydrophilic residue (e.g., Ser or Asn) is substituted for, or by, a 
hydrophobic residue (e.g., Ala, Leu, lie, Phe or Val); (b) a cysteine or proline is substituted 
for, or by, any other residue; (c) a residue having an electropositive side chain (e.g., Arg, 
His or Lys) is substituted for, or by, an electronegative residue (e.g., Glu or Asp) or (d) a 

10 residue having a smaller side chain (e.g., Ala, Ser) or no side chain (e.g., Gly) is 
substituted for, or by, one having a bulky side chain (e.g., Phe or Trp). 

2.4 Polypeptide derivatives 

A polypeptide can typically tolerate one or more amino acid deletions and 
insertions in its amino acid sequence without loss or significant loss of a desired activity. 
15 Accordingly, the invention also contemplates derivatives of the parent polypeptides of the 
invention described in Section 2.1 or biologically active fragments thereof or variants of 
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these, which include amino acid deletions and/or additions, wherein said derivatives 
comprise one or more activities selected form the group consisting of acyl-CoA ligase 
activity, /3-ketoacyl synthase activity, j8-ketoacyl reductase, acyl carrier protein activity, 
adenylation activity, peptidyl carrier protein activity, condensation activity, PPTase 
5 activity and methyltransferase activity associated with antibiotic biosynthesis, and 
preferably with albicidin biosynthesis. 

Preferred derivatives of the invention include PKS-NRPS molecules with altered 
activities in one or more respects and thus produce polyketides other than the albicidin 
natural product(s) of the XabB. A PKS-NRPS derived from XabB by such alteration 

10 includes a modular PKS-NRPS (or its corresponding encoding gene(s)) that retains the 
scaffolding of the utilised portion encoded by the naturally occurring gene. Not all domains 
or modules need be altered. On the constant scaffold, at least one enzymatic activity is 
mutated, deleted, replaced, or inserted so as to alter the activity of the resulting PKS-NRPS 
relative to the original or parent PKS-NRPS. Alteration results when these activities are 

15 deleted or are replaced by a different version of the activity, or simply mutated in such a 
way that a polyketide other than the natural product results from these collective activities. 
This occurs because there has been a resulting alteration of the starter unit and/or 
elongation unit, stereochemistry, chain length or cyclisation, and/or reductive or 
dehydration cycle outcome at a corresponding position in the product polyketide. Where a 

20 deleted activity is replaced, the origin of the replacement activity may come from a 
corresponding activity in a different naturally occurring PKS or PKS-NRPS or from a 
different region of the albicidin PKS-NRPS. Any or all PKS/NRPS genes may be included 
in the derivative or portions of any of these may be included, but the scaffolding of the 
albicidin PKS-NRPS protein is preferably retained in whatever derivative is constructed. 

25 Thus, a PKS-NRPS derived from the albicidin PKS-NRPS includes a PKS-NRPS 

that contains the scaffolding of all or a portion of XabB. The derived PKS-NRPS also 
contains at least two elongation modules that are functional and preferably at least three 
elongation modules. The derived PKS-NRPS also contains mutations, deletions, insertions, 
or replacements of one or more of the activities of the functional domains or modules of 

30 XabB so that the nature of the resulting polyketide is altered. Exemplary embodiments 
include those wherein a KS or ACP domain has been deleted or replaced by a version of 
the activity from a different PKS/NRPS or from another location within XabB. Also 
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contemplated are derivatives where at least one non-condensation cycle enzymatic activity 
(KR, KR, or A) has been deleted or added or wherein any of these activities has been 
mutated so as to change the structure of the polyketide synthesised by the PKS. 

Other derivatives contemplated by the present invention include fusion of the 
5 polypeptides, fragments and polypeptide variants of the invention with other polypeptides 
or proteins. For example, it will be appreciated that said polypeptides, fragments or 
variants may be incorporated into larger polypeptides, and that such larger polypeptides 
may also be expected to have one or more of the activities mentioned above. The 
polypeptides, fragments or variants of the invention may be fused to a further protein, for 
10 example, which is not derived from the original host. The further protein may assist in the 
purification of the fusion protein. For instance, a polyhistidine tag or a maltose binding 
protein may be used in this respect as described in more detail below. Other poss/ble fusion 
proteins are those which produce an immunomodulatory response. Particular examples of 
such proteins include Protein A or glutathione S-transferase (GST). 

15 Other derivatives contemplated by the invention include, but are not limited to, 

modification to side chains, incorporation of unnatural amino acids and/or their derivatives 
during peptide, polypeptide or protein synthesis and the use of crosslinkers and other 
methods which impose conformational constraints on the polypeptides, fragments and 
variants of the invention. Examples of side chain modifications contemplated by the 

20 present invention include modifications of amino groups such as by acylation with acetic 
anhydride; acylation of amino groups with succinic anhydride and tetrahydrophthalic 
anhydride; amidination with methylacetimidate; carbamoylation of amino groups with 
cyanate; pyridoxylation of lysine with pyridoxal-5-phosphate followed by reduction with 
NaBEU; reductive alkylation by reaction with an aldehyde followed by reduction with 

25 NaBILj; and trinitrobenzylation of amino groups with 2, 4, 6-trinitrobenzene sulphonic acid 
(TNBS). The carboxyl group may be modified by caibodiimide activation via O- 
acylisourea formation followed by subsequent derivatisation, by way of example, to a 
corresponding amide. The guanidine group of arginine residues may be modified by 
formation of heterocyclic condensation products with reagents such as 2,3-butanedione, 

30 phenylglyoxal and glyoxal. Sulphydryl groups may be modified by methods such as 
performic acid oxidation to cysteic acid; formation of mercurial derivatives using 4- 
chloromercuriphenylsulphonic acid, 4-chloromercuribenzoate; 2-chloromercuri-4- 
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nitrophenol, phenylmercury chloride, and other mercurials; formation of a mixed 
disulphides with other thiol compounds; reaction with maleimide, maleic anhydride or 
other substituted maleimide; carboxymethylation with iodpacetic acid or iodoacetamide; 
and carbamoylation with cyanate at alkaline pH. Tryptophan residues may be modified, for 
5 example, by alkylation of the indole ring with 2-hydroxy-5-nitrobenzyl bromide or 
sulphonyl halides or by oxidation with N-bromosuccinimide. Tyrosine residues may be 
modified by nitration with tetranitromethane to form a 3-nitrotyrosine derivative. The 
imidazole ring of a histidine residue may be modified by N-carbethoxylation with 
diethylpyrocarbonate or by alkylation with iodoacetic acid derivatives. 

10 Examples of incorporating unnatural amino acids and derivatives during peptide 

synthesis include but are not limited to, use of 4-amino butyric acid, 6-aminohexanoic acid, 
4-amino-3-hydroxy-5-phenylpentanoic acid, 4-amino-3-hydroxy-6-methylheptanoic acid, 
t-butylglycine, norleucine, norvaline, phenylglycine, ornithine, sarcosine, 2-thienyl alanine 
and/or D-isomers of amino acids. A list of unnatural amino acids contemplated by the 

1 5 present invention is shown in TABLE C. 



TABLE C 



Non-conventional \ amino acid \~ . 


Non-conveiitional amino acid , 


a-aminobutyric acid 


L-N-methylalanine 


ce-amino-a-methylbutyrate 


L-N-methylarginine 


aminocyclopropane-carboxylate 


L-N-methylasparagine 


aminoisobutyric acid 


L-N-methylaspartic acid 


aminonorbornyl-carboxylate 


L-N-methylcysteine 


cyclohexylalanine 


L-N-methylglutamine 


cyclopentylalanine 


L-N-methylglutamic acid 


L-N-methylisoleucine 


L-N-methylhistidine 


D-alanine 


L-N-methylleucine 


D-arginine 


L-N-methyllysine 


D-aspartic acid 


L-N-methylmethionine 
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Non-cotivcntioiicil dviino acid 


Afrm-rnnvpiifinTlfll (ITYllYlO dCld 


D-cysteine 


T -Tsl-m ptVi vln nr1 p,i i ni n e 


D-glutamate 


T -/\T_m p fh \/1 n nrvfl 1 i* n P 


D-glutanuc acid 


T -7C-mptliv1 ornithine 

ij 1> lilt/ 111 Jf 1 Willi Ullllw 


D-histidine 


T -"W-methv1nhenvlalanine 

JL* IN lilt hi y il?ii wiajt iclicvi.jj_j.aw 


D-isoIeucine 


T ._"M_m pfli vlnrnline 

JLv^LN "JllClllJf 1^1 V1JJJ.W 


u- leucine 


T .-N-TTi edl vl serine 


D-lysine 


T -TsJ-mptlivlthrponine 

Jj ll "lllCllljf 1L111 tWilllJLV 


D-methionine 


T -W-mptlwIfrvntfYnhfln 


D-omitnine 


T "W_tyi pf Vi"\/1 "h/rr* citip 
JU~rN -iiiciiiy liyi uoiiiw 


D-phenylalanine 


T -7sJ-m ptVi vl 1 i Ti r 
j_j i> -iiic hi y i v ciiiiiw 


D-proline 


jL*"i>( -me uiyic my igi y wiii^ 


D-serine 


T XT ■m&tlwA t-lmf'vl crl witiP 

jj-iN -iucinyi- 1- u uty igiy oiiic 


D-threonine 


i_> - llUl 1C liL/JXLC 


u-trypLOpjian 


T -nnrvalinp 

v aiiiiw 


D-tyrosine 


rv-m pfh vl -am i n ni <sohut vrate 

\JL 111VI11 VI (UllUlUluVv l«HJf i 


D -valine 


/v-m ptVi vl -"V- amin nhi 1 tvr at e 

LX lllCLliy 1 v-cUlilllvUU.tjfl<*lW 


D-cx-methylalanine 


/v-m pfh v! r»vn 1 nh p y vl a 1 an i n e 

UL illC my 1L, jr L/HJllG Ay iciicuiiii 


D-Oi-methylarginine 


/v-m pfh v! r v1 p nn pn t vl al ani n e 

(JC 111 w HIV l\* y W/ L/LI wilt j icn uiuiiv 


jj-of-meitiyiasparagine 


/v-m pfh vl -/v-n anfh vl al an ine 


jj -u.-me inyi asp aruue 


/v-m pfh vl n pn i cA 1 1 am ine 

LJt lllw 111 ¥ 1 LP Ulilvl 11CU 1 1 1 1 Iw 


jl/ -u-iiic my ivy o icmc 


N-f4-aminobutvn elvcine 


D-a-methylglutamine 


N-(2-aininoethyl)glycine 


D-a-methylhistidine 


N-(3-aminopropyl)glycine 


D-a-methylisoleucine 


N-aniino-a-methylbutyrate 


D-a-methylleucine 


a-napthylalanine 
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Non-conventional cLtnitio acid 


Nnrt-rntivptitimtnl ciminn acid. 


D-OMnethyllysine 


T*J_V> ati 7"v1 crl vci n e 


jj-Ct-memyimeimonHic 


T^-f2-carbamvledivnglvcine 


I A A # wi A^Mvn Amif nlltlA 

i^-tt-inetnyionutniiiie 


TC-f parham vlmeth vH ??1 vcine 


jj - u-me tnyipnenyi aj anine 


lsj-f 9 - r arbox veth vl ^ j?1 vein e 

i 1 i ^ vol i/uaj will y i Jo Jf A A ^ 


D-Qi-methylprohne 


"W- n rb fi vvm ft tb vl ^ p1 vci n e 


jj-o^meuiyisenne 


Tvl-p vp Inbi it vl p1 vcine 


u - u-me inynmeoiiiiic 


T^J-cvclohentvl f?l vcine 


Jj-Q^inetayinyptopjian 


"NJ.pvplnh ftY vl cO vein e 


ij-Q^memyityrosine 


TsJ-p vp1 p*H ftp vl 0"! vcine 


L-OE-methylleucine 


T -/Y-mpfVi vl1 v^inp 


1^-u-inctnyimcXmOmIie 


T -At-mptHvlTinrlpiicme 


Lr- u-me uiyinorv aiine 


T -p/-rnpfh vlnrrnffitrie 
j_/ uc 111 y lVyiintiAiiiw 


L-a-methylphenylalanine 


L-a-methylproline 


L-a-methylserine 


L-a-methylthreonine 


L- c/-m e th vltr vd t ODhan 


L-a-methyltyrosine 


L-a-methylvaiine 


L-N-methylhomophenylalanine 


N-(N-(2,2-diphenylethyl 
carbamylmetiiyl)glycine 


N-(N-(3,3-diphenylpropyl 
carbamylmethyl)glycine 


1 -carboxy- 1 -(2,2-diphenyl-ethyl 
amino)cyclopropane 





Also contemplated is the use of crosslinkers, for example, to sterilise 3D 
conformations of the polypeptides, fragments or variants of the invention, using homo- 
bifunctional cross linkers such as Afunctional imido esters having (CH2) n spacer groups 
5 with n = 1 to n = 6, glutaraldehyde, N-hydroxysuccinimide esters and hetero-bifiinctional 
reagents which usually contain an amino-reactive moiety such as N-hydroxysuccinimide 
and another group specific-reactive moiety such as maleimido or dithio moiety or 
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carbodiimide. In addition, peptides can be conformationally constrained, for example, by 
introduction of double bonds between C a and C$ atoms of amino acids, by incorporation of 
C a and Ncrmethylamino acids, and by formation of cyclic peptides or analogues by 
introducing covalent bonds such as forming an amide bond between the N and C termini 
5 between two side chains or between a side chain and the N or C terminus of the peptides or 
analogues. For example, reference may be made to: Marlowe (1993, Biorganic & 
Medicinal Chemistry Letters 3: 437-44) who describes peptide cyclisation on TFA resin 
using trimethylsilyl (TMSE) ester as an orthogonal protecting group; Pallin and Tarn 
(1995, J. Chem. Soc. Chem. Comm. 2021^2022) who describe the cyclisation of 

10 unprotected peptides in aqueous solution by oxime formation; Algin et al (1994, 
Tetrahedron Letters 35: 9633-9636) who disclose solid-phase synthesis of head-to-tail 
cyclic peptides via lysine side-chain anchoring; Elates et al (1993, Tetrahedron Letters 34: 
1549-1552) who describe the production of head-to-tail cyclic peptides by three- 
dimensional solid phase strategy; Tumelty et al (1994, J. Chem. Soc. Chem. Comm. 1067- 

15 1068) who describe the synthesis of cyclic peptides from an immobilised activated 
intermediate, wherein activation of the immobilised peptide is carried out with N- 
protecting group intact and subsequent removal leading to cyclisation; McMiirray et al 
(1994, Peptide Research 7: 195-206) who disclose head-to-tail cyclisation of peptides 
attached to insoluble supports by means of the side chains of aspartic and glutamic acid; 

20 Hruby et al (1994, Reactive Polymers 22: 231-241) who teach an alternate method for 
cyclising peptides via solid supports; and Schmidt and Langer (1997, J. Peptide Res. 49: 
67-73) who disclose a method for synthesising cyclotetrapeptides and cyclopentapeptides. 
The foregoing methods may be used to produce conformationally constrained polypeptides 
that comprise one or more activities selected form the group consisting of acyl-CoA ligase 

25 activity, j3-ketoacyl synthase activity, /3-ketoacyl reductase, acyl carrier protem activity, 
adenylation activity, peptidyl carrier protein activity, condensation activity, PPTase 
activity and methyltransferase activity associated with the production of polyketides and 
particularly albicidins or analogues thereof. 

The invention also contemplates polypeptides, fragments or variants of the 
30 invention that have been modified using ordinary molecular biological techniques so as to 
improve their resistance to proteolytic degradation or to optimise solubility properties or to 
render them more suitable as an immunogenic agent. 
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3. Polynucleotides of the invention 

3.1 Polynucleotides encoding polypeptides of the invention 

3.1.1 Albicidin synthetase-encoding polynucleotides 

The invention further provides a polynucleotide that encodes a PKS-NRPS 
5 polypeptide of the invention, or biologically active fragment thereof, or /ariant or 
derivative of these as defined above. In one embodiment, the polynucleotide comprises the 
entire sequence of nucleotides set forth in SEQ ID NO: 1. SEQ ID NO: 1 corresponds to a 
16511-bp X. albilineans xabB cistron. SEQ ID NO: 3, defines the full-length coding 
sequence of xabB and encodes various sequence signature motifs at the following 
10 nucleotide positions: 

(a) acyl-CoA ligase (AL) motif I from about nucleotide 676 to about nucleotide 720, 
and motif II from about nucleotide 1456 to about nucleotide 1477; 

(b) /3-ketoacyl synthase 1 (KS1) motif I from about nucleotide 2689 to about 
nucleotide 2739, motif II from about nucleotide 3112 to about nucleotide 3141, and 

1 5 motif III from about nucleotide 3238 to about nucleotide 3267; 

(c) /?-ketoacyl synthase 2 (KS2) motif I from about nucleotide 8329 to about 
nucleotide 8379, motif II from about nucleotide 8752 to about nucleotide 4781, and 
motif HI from about nucleotide 8863 to about nucleotide 8892; 

(d) jS-ketoacyl reductase (KR) motif from about nucleotide 5434 to about nucleotide 
20 5526; 

(e) acyl carrier protein 1 (ACPI) motif from about nucleotide 1999 to about 
nucleotide 2034; 

(f) acyl carrier protein 2 (ACP2) motif from about nucleotide 7450 to about 
nucleotide 7485; 

25 (g) acyl carrier protein 3 (ACP3) motif from about nucleotide 7702 to about 

nucleotide 7735; 

(h) adenylation domain (A) motif I from about nucleotide 11416 to about nucleotide 
11433, motif II from about nucleotide 11551 to about nucleotide 11583, motif HI from 
about nucleotide 11749 to about nucleotide 11796; motif IV from about nucleotide 
30 11899 to about nucleotide 11910, motif V from about nucleotide 12187 to about 
nucleotide 12207, motif VI from about nucleotide 12340 to about nucleotide 12384, 
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motif VH from about nucleotide 12454 to about nucleotide 12471, motif Vm from 
about nucleotide 12508 to about nucleotide 12567, motif IX from about nucleotide 
12715 to about nucleotide 12735, and motif X from about nucleotide 127 /5 to about 
nucleotide 12792; 

5 (i) peptidyl carrier protein 1 (PCP1) motif from about nucleotide 9781 to about 

nucleotide 9813; 

(j) peptidyl carrier protein 2 (PCP2) motif from about nucleotide 129U5 to about 
nucleotide 12948; 

(k) condensation domain 1 (CI) motif I from about nucleotide 9997 to about 
10 nucleotide 10026, motif H from about nucleotide 10141 to about nucleotide 10167, and 
motif m from about nucleotide 10366 to about nucleotide 10395, motif IV from about 
nucleotide 10483 to about nucleotide 10503, motif V from about nucleotide 10816 to 
about nucleotide 10851, motif VI from about nucleotide 10921 to about nucleotide 
10941, motif Vn from about nucleotide 10972 to about nucleotide 10995; and 
15 (1) condensation domain 2 (C2) motif I from about nucleotide 1312C to about 

nucleotide 13149, motif H from about nucleotide 13261 to about nucleotide 13287, and 
motif ffl from about nucleotide 13492 to about nucleotide 13521, motif TV from about 
nucleotide 13612 to about nucleotide 13632, motif V from about nucleotide 13945 to 
about nucleotide 13977, motif VI from about nucleotide 14053 to about nucleotide 
20 14073, motif VH from about nucleotide 14101 to about nucleotide 14124. 

Those of skill in the art will recognise that, due to the degenerate nature of the 
genetic code, a variety of polynucleotides differing in their nucleotide sequences can be 
used to encode a given amino acid sequence of the invention. The native polynucleotide 
sequence encoding the PKS-NRPS of X. albilineans is shown herein merely to illustrate a 
25 preferred embodiment of the invention, and the invention includes polynucleotides of any 
sequence that encode the amino acid sequences of the polypeptides and proteins of the 
invention. 



3.1.2 PPTase-encoding polynucleotides 

The invention further provides a polynucleotide that encodes a PPTase 
30 polypeptide of the invention, or biologically active fragment thereof, or variant or 
derivative of these as defined above. In one embodiment, the polynucleotide comprises the 
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entire sequence of nucleotides set forth in SEQ ID NO: 82. SEQ ID NO: 82 corresponds to 
a 1200-bp X albilineans xabA cistron. This sequence encodes a PPTase catalytic domain 
from about nucleotide 475 to about nucleotide 654. This domain comprises two conserved 
PPTase sequence motifs: (I) motif I encoded by a nucleotide sequence from about 
5 nucleotide 475 to about nucleotide 501 ; and (II) motif II encoded by a nucleotide sequence 
from about nucleotide 619 to about nucleotide 654, of SEQ ID NO: 82. The intervening 
amino acid sequence, linking motifs 1 and II, is encoded by a nucleotide sequence from 
about nucleotide 502 to about nucleotide 618 of SEQ ID NO: 82. The said nucleotide 
sequences are presented for convenience in SEQ ID NO: 86, 88, 92 and 90, respectively. 
10 Suitably, the polynucleotide comprises the sequence set forth in SEQ ID NO: 84, which 
defines the full-length coding sequence of xabA. Alternatively, the polynucleotide 
comprises a contiguous sequence of nucleotides contained within the sequence set forth in 
SEQ ID NO: 86, which encodes the PPTase catalytic domain. 

3.1.3 Methyltransferase-encoding polynucleotides 

15 The invention further provides a polynucleotide that encodes a methyltransferase 

polypeptide of the invention, or biologically active fragment thereof, or variant or 
derivative of these as defined above. In one embodiment, the polynucleotide comprises the 
entire sequence of nucleotides set forth in SEQ ID NO: 94. SEQ ID NO: 94 corresponds to 
a 1515-bp X albilineans xabC cistron. This sequence encodes three conserved 

20 methyltransferase sequence motifs: (I) motif I encoded by a nucleotide sequence from 
about nucleotide 565 to about nucleotide 585; (II) motif II encoded by a nucleotide 
sequence from about nucleotide 741 to about nucleotide 774; and (HI) motif m encoded by 
a nucleotide sequence from about nucleotide 841 to about nucleotide 867, or SEQ ID NO: 
94. The said nucleotide sequences are presented for convenience in SEQ ID NO: 98, 100 

25 and 102, respectively. Suitably, the polynucleotide comprises the sequence set forth in 
SEQ ID NO: 96, which defines the full-length coding sequence of xabC. Alternatively, the 
polynucleotide comprises a contiguous sequence of nucleotides contained within the 
sequence set forth in SEQ ID NO: 104 or 106, which encode biologically active fragments 
as described in Section 2.2. 
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3.2 Polynucleotide variants 

In general, polynucleotide variants according to the invention comprise regions 
that show at least 60%, more suitably at least 70%, preferably at least 80%, and more 
preferably at least 90% sequence identity over a reference polynucleotide sequence of 
5 identical size (" 'comparison window") or when compared to an aligned sequence in which 
the alignment is performed by a computer homology program known in the art. What 
constitutes suitable variants may be determined by conventional techniques. For example, 
a polynucleotide comprising at least one sequence selected from the group consisting of 
SEQ ID NO: 1, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 39, 41, 43, 

10 45, 47, 49, 51, 53, 55, 57, 59, 61, 63, 65, 67, 69, 71, 73, 75, 77, 79, 82, 84, 86, 88, 90, 92, 
94, 96, 98, 100, 102 and 104 can be altered using any suitable method including 
conventional recombinant techniques and mutagenesis methods such as random 
mutagenesis (e.g., transposon mutagenesis), oligonucleotide-mediated (or site-directed) 
mutagenesis, PCR mutagenesis and cassette mutagenesis of an earlier prepared variant or 

1 5 non-variant version of an isolated polynucleotide of the invention. 

Alternatively, polynucleotide sequences variants encoding heterologous 
PKS/NRPS enzymes for producing PKS-NRPS variants of the invention may to obtained 
from other secondary metabolite- or polyketide-producing organisms. For example, such 
variants may be prepared according to the following procedure: 
20 (a) creating primers which are optionally degenerate wherein each comprises a 

portion of a reference polynucleotide encoding a reference polypeptide or fragment of 
the invention, preferably encoding at least one sequence selected from the group 
consisting of SEQ ID NO: 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 
38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 83, 
25 87, 89, 91, 93, 95, 99, 101, 103, 105 and 107; 

(b) obtaining a nucleic acid extract from a secondary metabolite-producing 
organism, which is preferably a bacterium, more preferably from a species of the family 
Pseudomonadaceae, more preferably from ^Xanthomonas species; and 

(c) using said primers to amplify, via nucleic acid amplification techniques, at 
30 least one amplification product from said nucleic acid extract, wherein said 

amplification product corresponds to a polynucleotide variant. 
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Suitable nucleic acid amplification techniques are well known to the skilled 
addressee, and include polymerase chain reaction (PCR) as for example described in 
Ausubel et al (supra); strand displacement amplification (SDA) as for example described 
in U.S. Patent No 5,422,252; rolling circle replication (RCR) as for example described in 
5 Liu et al, (1996, J. Am. Chem. Soc. 118:1587-1594 and International application WO 
92/01813) and Lizardi et al., (International Application WO 97/19193); nucleic acid 
sequence-based amplification (NASBA) as for example described by Sooknanan et al, 
(1994, Biotechniques 17:1077-1080); and Q-/3 replicase amplification as for example 
described by Tyagi et al, (1996, Proc. Natl. Acad. Sci. USA 93: 5395-5400). 

10 Typically, polynucleotide variants that are substantially complementary to a 

reference polynucleotide are identified by blotting techniques that include a step whereby 
nucleic acids are immobilised on a matrix (preferably a synthetic membrane such as 
nitrocellulose), followed by a hybridisation step, and a detection step. Southern blotting is 
used to identify a complementary DNA sequence; northern blotting is used to identify a 

15 complementary RNA sequence. Dot blotting and slot blotting can be used to identify 
complementary DNA/DNA, DNA/RNA or RNA/RNA polynucleotide sequences. Such 
techniques are well known by those skilled in the art, and have been described in Ausubel 
et al. (1994-1998, supra) at pages 2.9.1 through 2.9.20. 

According to such methods, Southern blotting involves separating DNA 
20 molecules according to size by gel electrophoresis, transferring the size-separated DNA to 
a synthetic membrane, and hybridising the membrane-bound DNA to a complementary 
nucleotide sequence labelled radioactively, enzymatically or fluorochromatically. In dot 
blotting and slot blotting, DNA samples are directly applied to a synthetic membrane prior 
to hybridisation as above. An alternative blotting step is used when identifying 
25 complementary polynucleotides in a cDNA or genomic DNA library, such as through the 
process of plaque or colony hybridisation. A typical example of this procedure is described 
in Sambrook et al ("Molecular Cloning. A Laboratory Manual", Cold Spring Harbour 
Press, 1989) Chapters 8-12. 

Typically, the following general procedure can be used to determine hybridisation 
30 conditions. Polynucleotides are blotted/transferred to a synthetic membrane, as described 
above. A reference polynucleotide such as a polynucleotide of the invention is labelled as 
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described above, and the ability of this labelled polynucleotide to hybridise with an 
immobilised polynucleotide is analysed. A skilled addressee will recognise that a number 
of factors influence hybridisation. The specific activity of radioactively labelled 
polynucleotide sequence should typically be greater than or equal to about 10 8 dpm/mg to 
5 provide a detectable signal. A radiolabeled nucleotide sequence of specific activity 10 to 
10 9 dpm/mg can detect approximately 0.5 pg of DNA. It is well known in the art that 
sufficient DNA must be immobilised on the membrane to permit detection. It is desirable 
to have excess immobilised DNA, usually 10 /xg. Adding an inert polymer such as 10% 
(w/v) dextran sulfate (MW 500,000) or polyethylene glycol 6000 during hybridisation can 
10 also increase the sensitivity of hybridisation (see Ausubel supra at 2.10.10). 

To achieve meaningful results from hybridisation between a polynucleotide 
immobilised on a membrane and a labelled polynucleotide, a sufficient amount of the 
labelled polynucleotide must be hybridised to the immobilised polynucleotide following 
washing. Washing ensures that the labelled polynucleotide is hybridised only to the 

15 immobilised polynucleotide with a desired degree of complementarity to the labelled 
polynucleotide. It will be understood that polynucleotide variants according to the 
invention will hybridise to a reference polynucleotide under at least low stringency 
conditions. Reference herein to low stringency conditions include and encompass from at 
least about 1% v/v to at least about 15% v/v formamide and from at least about 1 M to at 

20 least about 2 M salt for hybridisation at 42° C, and at least about 1 M to at least about 2 M 
salt for washing at 42° C. Low stringency conditions also may include 1% Bovine Serum 
Albumin (BSA), 1 mM EDTA, 0.5 M NaHP0 4 (pH 7.2), 7% SDS for hybridisation at 
65° C, and (i) 2xSSC, 0.1% SDS; or (ii) 0.5% BSA, 1 mM EDTA, 40 mM NaHP0 4 (pH 
7.2), 5% SDS for washing at room temperature. 

25 Suitably, the polynucleotide variants hybridise to a reference polynucleotide under 

at least medium stringency conditions. Medium stringency conditions include and 
encompass from at least about 16% v/v to at least about 30% v/v formamide and from at 
least about 0.5 M to at least about 0.9 M salt for hybridisation at 42° C, and at least about 
0.1 M to at least about 0.2 M salt for washing at 55° C. Medium stringency conditions also 

30 may include 1% Bovine Serum Albumin (BSA), 1 mM EDTA, 0.5 M NaHP0 4 (pH 7.2), 
7% SDS for hybridisation at 65° C, and (i) 2 x SSC, 0.1% SDS; or (ii) 0.5% BSA, 1 mM 
EDTA, 40 mM NaHP0 4 (pH 7.2), 5% SDS for washing at 60-65° C. 
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Preferably, the polynucleotide variants hybridise to a reference polynucleotide 
under high stringency conditions. High stringency conditions include and encompass from 
at least about 31% v/v to at least about 50% v/v formamide and from about 0.01 M to 
about 0.15 M salt for hybridisation at 42° C, and about 0.01 M to about 0.02 M salt for 
5 washing at 55° C. High stringency conditions also may include 1% BSA, 1 mM EDTA, 0.5 
M NaHP0 4 (pH 7.2), 7% SDS for hybridisation at 65° C, and (i) 0.2 x SSC, 0.1% SDS; or 
(ii) 0.5% BSA, ImM EDTA, 40 mM NaHP0 4 (pH 7.2), 1% SDS for washing at a 
temperature in excess of 65° C. 

Other stringent conditions are well known in the art. A skilled addressee will 
10 recognise that various factors can be manipulated to optimise the specificity of the 
hybridisation. Optimisation of the stringency of the final washes can serve to ensure a high 
degree of hybridisation. For detailed examples, see Ausubel et al, supra at pages 2.10.1 to 
2.10.16 and Sambrook et al (1989, supra) at sections 1.101 to 1.104. 

While stringent washes are typically carried out at temperatures from about 42° C 
15 to 68° C, one skilled in the art will appreciate that other temperatures may be suitable for 
stringent conditions. Maximum hybridisation rate typically occurs at about 20° C to 25° C 
below the T m for formation of a DNA-DNA hybrid. It is well known in the art that the T m 
is the melting temperature, or temperature at which two complementary polynucleotide 
sequences dissociate. Methods for estimating T m are well known in the art (see Ausubel et 
20 al , supra at page 2. 1 0.8). 

In general, the T m of a perfectly matched duplex of DNA may be predicted as an 
approximation by the formula: 

T m = 81.5 + 16.6 (log 10 M) + 0.41 (%G+C) - 0.63 (% formamide) - (600/length) 

wherein: M is the concentration of Na + , preferably in the range of 0.01 molar to 
25 0.4 molar; %G+C is the sum of guanosine and cytosine bases as a percentage of the total 
number of bases, within the range between 30% and 75% G+C; % formamide is the 
percent formamide concentration by volume; length is the number of base pairs in the 
DNA duplex. 
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The T m of a duplex DNA decreases by approximately 1° C with every increase of 
1% in the number of randomly mismatched base pairs. Washing is generally carried out at 
T m - 1 5° C for high stringency, or T m - 30° C for moderate stringency. 

In a preferred hybridisation procedure, a membrane (e.g.; a nitrocellulose 
5 membrane or a nylon membrane) containing immobilised DNA is hybridised overnight at 
42° C in a hybridisation buffer (50% deionised formamide, 5xSSC, 5x Denhardi's solution 
(0.1% ficoll, 0.1% polyvinylpyrollidone and 0.1% bovine serum albumin), 0.1% SDS and 
200 mg/mL denatured salmon sperm DNA) containing labelled probe. The membrane is 
then subjected to two sequential medium stringency washes (*.e., 2xSSC, 0.1% SDS for 15 
10 min at 45° C, followed by 2xSSC, 0.1% SDS for 15 min at 50° C), followed by two 
sequential higher stringency washes (i.e., 0.2xSSC, 0.1% SDS for 12 min at 55° C 
followed by 0.2xSSC and 0.1%SDS solution for 12 min at 65-68° C. 

Methods for detecting a labelled polynucleotide hybridised to an immobilised 
polynucleotide are well known to practitioners in the art. Such methods include 
15 autoradiography, phosphorimaging, and chemiluminescent, fluorescent and colorimetric 
detection. 

4. Expression vectors 

The present invention further provides expression vectors designed for genetic 
transformation of cells, preferably prokaryotic cells, comprising a polynucleotide, fragment 
20 or variant according to the invention operably linked to a regulatory polynucleotide. An 
expression vector is typically a nucleic acid that can be introduced into a host cell or cell- 
free transcription and translation system. An expression vector can be maintained 
permanently or transiently in a cell, whether as part of the chromosomal or other DNA in 
the cell or in any cellular compartment, such as a replicating vector in the cytoplasm. 

25 The various components of an expression vector can vary widely, depending on 

the intended use of the vector and especially the host cell(s) in which the vector is intended 
to replicate or drive expression. For example, the regulatory polynucleotide, which is used 
to control expression of a polynucleotide of the invention, will generally be appropriate for 
the host cell used for expression. Numerous types of appropriate expression vectors and 

30 suitable regulatory sequences are known in the art for a variety of host cells. Typically, the 
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regulatory polynucleotide includes, but is not limited to, promoter sequences, leader or 
signal sequences, ribosomal binding sites, transcriptional start and stop sequences, 
translational start and termination sequences, and enhancer or activator sequences. 
Constitutive or inducible promoters as known in the art are contemplated by the invention. 
5 The promoters may be either naturally occurring promoters, or hybrid promoters that 
combine elements of more than one promoter. 

In a preferred embodiment, the expression vector is operable in a Gram-negative 
prokaryotic cell. A variety of prokaryotic expression vectors, which maybe used as a basis 
for constructing the expression vector of the invention. These include but are not limited to 
10 a chromosomal vector (e.g., a bacteriophage such as bacteriophage X), an 
extrachromosomal vector (e.g., a plasmid or a cosmid expression vector). The expression 
vector will also typically contain an origin of replication, which allows autonomous 
replication of the vector, and one or more selectable marker genes that allow phenotypic 
selection of the transformed cells. 

1 5 The expression vector may also include a fusion partner (typically provided by the 

expression vector) so that a recombinant polypeptide is expressed as a fusion polypeptide 
with said fusion partner. The main advantage of fusion partners is that they assist 
identification and/or purification of said fusion polypeptide. In order to express said fusion 
polypeptide, it is necessary to ligate a polynucleotide according to the invention into the 

20 expression vector so that the translational reading frames of the fusion partner and the 
polynucleotide coincide. Well known examples of fusion partners include, but are not 
limited to, glutathione-S-transferase (GST), Fc potion of human IgG, maltose binding 
protein (MBP) and hexahistidine (HISe), which are particularly useful for isolation of the 
fusion polypeptide by affinity chromatography. For the purposes of fusion polypeptide 

25 purification by affinity chromatography, relevant matrices for affinity chromatography are 
glutathione-, amylose-, and nickel- or cobalt-conjugated resins respectively. Many such 
matrices are available in "kit" form, such as the QIAexpress™ system (Qiagen) useful with 
(fflS 6 ) fusion partners and the Pharmacia GST purification system. In a preferred 
embodiment, the recombinant polynucleotide is expressed in the commercial vector 

3 0 pFLAG as described more fully hereinafter. Another fusion partner well known in the art is 
green fluorescent protein (GFP). This fusion partner serves as a fluorescent "tag" which 
allows the fusion polypeptide of the invention to be identified by fluorescence microscopy 
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or by flow cytometry. The GFP tag is useful when assessing subcellular localisation of the 
fusion polypeptide of the invention, or for isolating cells which express the fusion 
polypeptide of the invention. Flow cytometric methods such as fluorescence activated cell 
sorting (FACS) are particularly useful in this latter application. Preferably, the fusion 

5 partners also have protease cleavage sites, such as for Factor X a or Thrombin, vhich allow 
the relevant protease to partially digest the fusion polypeptide of the invention and thereby 
liberate the recombinant polypeptide of the invention therefrom. The liberated polypeptide 
can then be isolated from the fusion partner by subsequent chromatographic separation. 
Fusion partners according to the invention also include within their scope "epitope tags", 

10 which are usually short peptide sequences for which a specific antibody is available. Well 
known examples of epitope tags for which specific monoclonal antibodies are readily 
available include c-Myc, influenza virus, haemagglutinin and FLAG tags. 

Preferred host cells for purposes of selecting vector components for expression 
vectors of the present invention include fungal host cells such as yeast and proksryotic host 
15 cells such as E. coli and X. albilineans, but mammalian cell cultures can also be used. In 
hosts such as yeasts, plants, or mammalian cells that ordinarily do not produce modular 
polyketide synthase enzymes, it may be necessary to provide, also typically by 
recombinant means, suitable holo-ACP synthases to convert the recombinantly produced 
PKS to functionality. 

20 The expression vector may be used to transform the desired host cell to produce a 

recombinant host cell for producing inter alia a recombinant polypeptide or polyketides, 
particularly albicidins or analogues thereof, as described hereinafter. 

5. Methods of preparing the polypeptides of the invention 

Polypeptides of the inventions, including the full-length parent polypeptides 
25 described in Section 2.1, or their biologically active fragments comprising, for example 
one or more domains (or fragments of such domains), or variants or derivatives of these, 
may be prepared by any suitable procedure known to those of skill in the art. For example, 
the polypeptides may be prepared by a procedure including the steps of: - 

(a) preparing a recombinant polynucleotide comprising a nucleotide sequence 
3 0 encoding a polypeptide comprising the sequence set forth in any one of SEQ ID NO: 4 
or a fragment thereof comprising at least one sequence selected from the group 
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consisting of SEQ ID NO: 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 
38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 83, 
87, 89, 91, 93, 95, 99, 101, 103, 105 and 107, or variant or derivative of these, which 
nucleotide sequence is operably linked to a regulatory polynucleotide; 
5 (b) introducing the recombinant polynucleotide into a suitable host cell; 

(c) culturing the host cell to express recombinant polypeptide from said 
recombinant polynucleotide; and 

(d) isolating the recombinant polypeptide. 

Suitably, said nucleotide sequence comprises at least one sequence selected from 
10 the group consisting of SEQ ID NO: 1, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 
33, 35, 37, 39, 41, 43, 45, 47, 49, 51, 53, 55, 57, 59, 61, 63, 65, 67, 69, 71, 73, 75, 77, 79, 
82, 84, 86, 88, 90, 92, 94, 96, 98, 100, 102 and 104. 

The recombinant polynucleotide is preferably in the form of an expression vector, 
which includes a self-replicating extra-chromosomal vector such as a plasmid, or a vector 
15 that integrates into a host genome, as for example described above in Section 4. The step of . 
introducing the recombinant polynucleotide into the host cell may be effected by any 
suitable means including transfection, and transformation, the choice of which will be 
dependent on the host cell employed. Such methods are well known to those of skill in the 
art. 

20 Recombinant polypeptides of the invention may be produced by culturing a host 

cell transformed with an expression vector containing nucleic acid encoding a polypeptide, 
biologically active fragment, variant or derivative according to the invention. The 
conditions appropriate for protein expression will vary with the choice of expression vector 
and the host cell. This is easily ascertained by one skilled in the art through routine 

25 experimentation. 

Suitable host cells for expression may be prokaryotic or eukaryotic. One preferred 
host cell for expression of a polypeptide according to the invention is a bacterium. The 
bacterium used may be Escherichia colu Alternatively, the host cell may be an insect cell 
such as, for example, SF9 cells that may be utilised with a baculovirus expression system. 
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The recombinant protein may be conveniently prepared by a person skilled in the 
art using standard protocols as for example described in Sambrook, et al, MOLECULAR 
CLONING. A LABORATORY MANUAL (Cold Spring Harbor Press, 1989), in particular 
Sections 16 and 17; Ausubel et al, CURRENT PROTOCOLS IN MOLECULAR 
5 BIOLOGY (John Wiley & Sons, Inc. 1994-1998), in particular Chapters 10 and 16; and 
Coligan et al, CURRENT PROTOCOLS IN PROTEIN SCIENCE (John Wiley & Sons, 
Inc. 1995-1997), in particular Chapters 1, 5 and 6. 

Alternatively, the polypeptide, fragments, variants or derivatives of the invention 
may be synthesised using solution synthesis or solid phase synthesis as described, for 
10 example, in Chapter 9 of Atherton and Shephard (supra) and in Roberge et al (1995, 
Science 269: 202). 

6. Antigen-binding molecules 

The invention also contemplates antigen-binding molecules that bind specifically 
to the aforementioned polypeptides, fragments, variants and derivatives. Preferably, an 
15 antigen-binding molecule according to the invention is immuno-interactive with any one or 
more of the amino acid sequences set forth in SEQ ID NO: 4, 6, 8, 10, 12, 14, 16, 18, 20, 
22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 
70, 72, 74, 76, 78, 80, 83, 87, 89, 91, 93, 95, 99, 101, 103, 105 and 107, or variants thereof. 

For example, the antigen-binding molecules may comprise whole polyclonal 
20 antibodies. Such antibodies may be prepared, for example, by injecting a polypeptide, 
fragment, variant or derivative of the invention into a production species, which may 
include mice or rabbits, to obtain polyclonal antisera. Methods of producing polyclonal 
antibodies are well known to those skilled in the art. Exemplary protocols which may be 
used are described for example in Coligan et al, CURRENT PROTOCOLS IN 
25 IMMUNOLOGY, (John Wiley & Sons, Inc, 1991), and Ausubel et al, (1994-1998, supra), 
in particular Section III of Chapter 11. 

In lieu of the polyclonal antisera obtained in the production species, monoclonal 
antibodies may be produced using the standard method as described, for example, by 
Kohler and Milstein (1975, Nature 256, 495-497), or by more recent modifications thereof 
30 as described, for example, in Coligan et al, (1991, supra) by immortalising spleen or other 
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antibody producing cells derived from a production species which has been inoculated with 
one or more of the polypeptides, fragments, variants or derivatives of the invention. 

The invention also contemplates as antigen-binding molecules Fv, Fab, Fab' and 
F(ab s >2 immunoglobulin fragments. Alternatively, the antigen-binding molecule may be in 

5 the form of a synthetic stabilised Fv (scFv) fragment, a disulphide stabilised Fv (dsFv) 
fragment, a diabody (dAb), a minibody and the like, or may comprise non-immunoglobulin 
derived, protein frameworks. The antigen-binding molecules of the invention may be used 
for affinity chromatography in isolating a natural or recombinant polypeptide or 
biologically active fragment of the invention. For example reference may be made to 

10 immunoaffinity chromatographic procedures described in Chapter 9.5 of Coligan et al, 
(1995-1997, supra). The antigen-binding molecules can be used to screen expression 
libraries for variant polypeptides of the invention as described herein. They can also be 
used to detect polypeptides, fragments, variants and derivatives of the invention as 
described hereinafter. 

1 5 7. Identification of modulators 

The invention also contemplates a method of screening for an agent that 
modulates the expression of a gene .selected from xabB, xabA, or xabC, or a gene 
belonging to the same regulatory or biosynthetic pathway as xabB, xabA, or xabC, or a 
variant of that gene, or that modulates the level and/or functional activity of an expression 

20 product of that gene or its variant. The method comprises contacting a preparation 
comprising said expression product {e.g., polypeptide or transcript), or a biologically active 
fragment thereof, or variant or derivative of these, or a genetic sequence that mo dulates the 
expression of said gene (e.g., the natural promoter relating to said gene, e.g., the xabB 
promoter, comprising the sequence set forth in SEQ ID NO: 81 or complement thereof), 

25 with a test agent, and detecting a change in the level and/or functional activity of said 
polypeptide or biologically active fragment thereof, or variant or derivative, or cf a product 
expressed from said genetic sequence. 

Modulators contemplated by the present invention includes agonists and 
antagonists of gene expression include antisense molecules, ribozymes and co-suppression 
30 molecules, as for example described in Section 2. Agonists include molecules which 
increase promoter activity or interfere with negative mechanisms. Agonists of a gene 
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include molecules which overcome any negative regulatory mechanism. Antagonists of 
polypeptides encoded by a gene of interest include antibodies and inhibitor peptide 
fragments. 

Candidate agents encompass numerous chemical classes, though typically they are 
5 organic molecules, preferably small organic compounds having a molecular weight of 
more than 50 and less than about 2,500 Dalton. Candidate agents comprise functional 
groups necessary for structural interaction with proteins, particularly hydrogen bonding, 
and typically include at least an amine, carbonyl, hydroxyl or carboxyl group, preferably at 
least two of the functional chemical groups. The candidate agents often comprise cyclical 
10 carbon or heterocyclic structures and/or aromatic or polyaromatic structures substituted 
with one or more of the above functional groups. Candidate agents are also found among 
biomolecules including, but not limited to: peptides, saccharides, fatty acids, steroids, 
purines, pyrimidines, derivatives, structural analogues or combinations thereof. 

Small (non-peptide) molecule modulators of a polypeptide according to the 
1 5 invention, or portion, or domain or module thereof are particularly preferred. In :his regard, 
small organic molecules typically have the ability to gain entry into an appropriate cell and 
affect the expression of a gene (e.g., by interacting with the regulatory region or 
transcription factors involved in gene expression); or affect the activity of a gene by 
inhibiting or enhancing the binding of accessory molecules. 

20 Alternatively, libraries of natural compounds in the form of bacterial, fiingal, plant 

and animal extracts are available or readily produced. Additionally, natural or synthetically 
produced libraries and compounds are readily modified through conventional chemical, 
physical and biochemical means, and may be used to produce combinatorial libraries. 
Known pharmacological agents may be subjected to directed or random chemical 

25 modifications, such as acylation, alkylation, esterification, amidification, etc. to produce 
structural analogues. Screening may also be directed to known pharmacologically active 
compounds and chemical analogues thereof. 

Screening for modulatory agents according to the invention can be achieved by 
any suitable method. For example, the method may include contacting a cell comprising a 
30 polynucleotide corresponding to a gene as defined above, with an agent suspected of 
having said modulatory activity and screening for the modulation of the level and/or 
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functional activity of a protein encoded by said polynucleotide, or the modulation of the 
level of an expression product encoded by the polynucleotide, or the modulation of the 
activity or expression of a downstream cellular target of said protein or said expression 
product. Detecting such modulation can be achieved utilising techniques including, but not 

5 restricted to, ELISA, cell-based ELISA, filter-binding ELISA, inhibition ELISA, Western 
blots, immunoprecipitation, slot or dot blot assays, irrmiunostaining, RIA, scintillation 
proximity assays, fluorescent immunoassays using antigen-binding molecule conjugates or 
antigen conjugates of fluorescent substances such as fluorescein or rhodamine, 
Ouchterlony double diffusion analysis, immunoassays employing an avidin-biotin or a 

10 streptavidin-biotin detection system, and nucleic acid detection assays including reverse 
transcriptase polymerase chain reaction (RT-PCR). 

It will be understood that a polynucleotide from which a target molecule of 
interest is regulated or expressed may be naturally occurring in the cell which is the subject 
of testing or it may have been introduced into the host cell for the purpose of testing. 

15 Further, the naturally-occurring or introduced sequence may be constitutively expressed - 
thereby providing a model useful in screening for agents which down-regulate expression 
of an encoded product of the sequence wherein said down regulation can be at the nucleic 
acid or expression product level - or may require activation - thereby providing a model 
useful in screening for agents that up-regulate expression of an encoded product of the 

20 sequence. Further, to the extent that a polynucleotide is introduced into a cell, that 
polynucleotide may comprise the entire coding sequence which codes for a target 
polypeptide or it may comprise a portion of that coding sequence {e.g. a domain or module 
as herein described) or a portion that regulates expression of a product encoded by the 
polynucleotide (e.g., a promoter). For example, the promoter that is naturally associated 

25 with the polynucleotide (fe the xabB promoter) may be introduced into the cell that is the 
subject of testing. In this regard, where only the promoter is utilised, detecting modulation 
of the promoter activity can be achieved, for example, by operably linking the promoter to 
a suitable reporter polynucleotide including, but not restricted to, green fluorescent protein 
(GFP), luciferase, fi-galactosidase and catecholamine acetyl transferase (CAT). Modulation 

30 of expression may be determined by measuring the activity associated with the reporter 
polynucleotide. 
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In another example, the subject of detection could be a downstream regulatory or 
biosynthetic target of the target molecule, rather than target molecule itself or the reporter 
molecule operably linked to a promoter of a gene encoding a product the expression of 
which is regulated by the target protein. 

5 These methods provide a mechanism for performing high throughput screening of 

putative modulatory agents such as proteinaceous or non-proteinaceous agents comprising 
synthetic, combinatorial, chemical and natural libraries. These methods will also facilitate 
the detection of agents which bind either the polynucleotide encoding the target molecule 
or which modulate the expression of an upstream molecule, which subsequently modulates 
10 the expression of the polynucleotide encoding the target molecule. Accordingly, these 
methods provide a mechanism of detecting agents that either directly or indirectly 
modulate the expression and/or activity of a gene or expression product according to the 
invention, 

8. Production of secondary metabolites 

15 The present invention further relates to a process for enhancing the level and/or 

functional activity of secondary metabolites, preferably albicidins, using one or more 
agents selected from the polynucleotides, polypeptides, fragments, variants, derivatives, 
vectors and modulatory agents described above. The process in a preferred embodiment, 
includes the steps of stably transforming a host cell with an expression vector as broadly 

20 described above, comprising at least one nucleic acid sequence encoding a polypeptide of 
the invention or a biologically active fragment or variant or derivative of these and 
isolating transformants which produce an enhanced amount of antibiotics, which are 
preferably of the albicidin class. The vector optionally comprises a signal sequence for 
secretion recognised by the host cell. Illustrative secretory leaders include the secretory 

25 leaders of penicillinase, a-factor, immunoglobulin, T-cell receptors, outer membrane 
proteins, glucoamylase, fungal amylase and the like. By fusion in proper reading frame, the 
mature polypeptide may be secreted into the medium. The host cell may be a eukaryote or 
a prokaryote cell In one embodiment, the cell naturally produces polyketides, preferably 
antibiotic polyketides and, in this regard, the cell is preferably X albilineans or other 

30 bacteria capable of producing albicidins. Optionally, the construct may include a 
transcription regulating sequence, which is not subject to repression by substances present 
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in the growth medium. The above process may be used to prepare antibiotics directly or 
they may be used to prepare cell free extracts containing increased quantities of antibiotics, 
preferably of the albicidin class, for in vitro preparation of said antibiotics. Suitably, these 
cell free extracts may be prepared for example using the method disclosed by Dobrogosz, 
5 WJ. (1981) Enzymatic activity. In Manual of Methods for General Bacteriology 
(Gerhardt, P., ed) Washington, DC: American Society for Microbiology, pp. 365-392. In a 
preferred embodiment, a vector from which a phosphopantetheinyl transferase (PPTase) 
can be translated is also introduced into the host cell. Expression of PPTase 
polynucleotides has been shown to be important for the production of polyketides in 
10 heterologous expression systems. Preferably, the PPTase is selected from EntD and/or 
XabA as for example disclosed herein. If desired, a vector from which a methyltransferase, 
more preferably and (9-methyltransferase, and even more preferablv an S- 
adenosylmethionine (9-methyltransferase can be translated may also be introduced into the 
host cell. An exemplary methyltransferase for this purpose is XabC as described herein. 

1 5 Alternatively, the expression hosts may be used as a source of increased quantities 

of antibiotics, which can be subsequently purified as for example disclosed by Birch et al 
in U.S. Patent No. 4,525,354. 

The invention also contemplates use of the polynucleotides, polypeptides, 
fragments, variant and derivatives of the invention in methods of combinatorial 

20 biosynthesis of novel antibiotics as for example disclosed by Khosla et al in U.S. Patent 
No. 5,712,146, Peterson et al in U.S. Patent No. 5,783,431 and Betlach et al in U.S. 
Patent No. 6,251,636 or in methods of producing antibiotics in hosts that ordinarily do not 
produce them as for example disclosed by Barr et al in U.S. Patent No. 6,033,883. As 
discussed in Section 2.4, the invention contemplates albicidin PKS-NRPS derivatives with 

25 altered activities in one or more respects for the production of polyketides other than the 
albicidin natural product(s) of the XabB. In this regard, expression vectors containing 
nucleotide sequences encoding a variety of such derivatives for the production of different 
polyketides are transformed into the appropriate host cells to construct a library. In one 
embodiment, a mixture of such vectors is transformed into selected host cells and the 

30 resulting cells plated into individual colonies and selected to identify successful 
transformants. A variety of strategies is available to obtain a multiplicity of colonies each 
containing a PKS gene cluster derived from the naturally occurring host gene cluster so 
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that each colony in the library produces a different PKS and ultimately a different 
polyketide, as for example disclosed by Betlach et al in U.S. Patent No. 6,251,636. The 
libraries thus produced can be considered at four levels: (1) a multiplicity of colonies each 
with a different PKS-NRPS encoding sequence; (2) the proteins produced from the coding 
5 sequences; (3) the polyketides produced from the proteins assembled into a functional 
PKS-NRPS; and (4) antibiotics or compounds with other desired activities derived from 
the polyketides. Colonies in the library can be induced to produce the relevant synthases 
and thus to produce the relevant polyketides to obtain a library of polyketides. Polyketides 
that are secreted into the media or have been otherwise isolated can be screened for 

10 binding to desired targets, such as receptors, signalling proteins, and the like. The 
supernatants per se can be used for screening, or partial or complete purification of the 
polyketides can first be effected. Typically, such screening methods involve detecting the 
binding of each member of the library to receptor or other target ligand. Binding can be 
detected either directly or through a competition assay. Means to screen such libraries for 

15 binding are well known in the art. Alternatively, individual polyketide members of the 
library can be tested against a desired target. In this event, screens wherein the biological 
response of the target is measured can more readily be included. Antibiotic activity can be 
verified using typical screening assays such as those for albicidin set forth in Example 1. 

The invention also extends to the use of the polynucleotides, polypeptides, 
20 fragments, variant and derivatives of the invention for the synthesis of antibiotics, 
preferably antibiotics of the albicidin class. 

The polynucleotides of the invention encoding XabB, or a biologically-active 
fragment or variant thereof, together with a recombinant polynucleotide encoding a PPTase 
and/or an 0-methyltransferase which participate or which are capable of participating in 
25 the albicidin biosynthetic pathway, provide the means to engineer high level co-expression 
of the albicidin synthetase, its activating PPTase and modifying methyltransferase to obtain 
higher yields of albicidins. 

In order that the invention may be readily understood and put into practical effect, 
particular preferred embodiments will now be described by way of the following non- 
3 0 limiting examples. 
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EXAMPLES 
EXAMPLE 1 

Albicidin multifunctional synthase zene 

Materials and Methods 

5 Bacterial strains and plasmids 

The properties of bacteria and plasmids used in this example are listed in Table 1 . 



Media, culture conditions and antibiotics 

X albilineans strains were routinely cultured on SP medium (Birch & Patil, 
1985b) at 28° C. Escherichia coli DH5cc and JM109 were used as hosts in cloning 

10 experiments and were grown on LB medium at 37° C (Sambrook et ai, 1989). Broth 
cultures were aerated by shaking at 200 r.p.m. on an orbital shaker. Modified YEB medium 
(Van Larebeke et al 9 1977) for patch mating consisted of 10 mg ml' 1 peptone, 5 mg ml/ 1 
yeast extract, 5 mg ml/ 1 NaCl, 5 mg ml/ 1 sucrose and 0.5 mg ml/ 1 MgS0 4 .7H 2 0. The 
following antibiotics were added to media as required: 50 |ig kanamycin ml/ 1 ; 15 \ig 

1 5 tetracycline mLl" 1 ; 1 00 |ag ampicillin ml/ 1 . 



Routine genetic procedures 

Bacterial genomic DNA and plasmid DNA isolation, gel electrophoresis, DNA 
restriction digests, ligation reactions and transformation were performed by routine 
procedures (Sambrook et aU 1989). DNA fragments were excised from agarose gels and 
20 residual agarose was removed with the BRESAclean™ DNA purification kit (GeneWorks, 
Adelaide). . 

Construction of a X. albilineans partial genomic library 

Genomic DNA from X albilineans XaI3 was digested with EcoKL and size- 
fractionated. DNA fragments of 15 to 20 kb were ligated to dephosphorylafed EcdBl- 
25 cleaved pBluescript SK n. The ligated DNA was electroporated into E. coli TOP 10. 
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Transformants were selected on LB agar medium containing ampicillin, and stored in LB 
broth with 15% glycerol at -70°C. 

PCR amplification 

5awHI-digested genomic DNA fromZ albilineans LSI 57 was religated at low 
5 concentration (0.5 jxg/mL) to generate circular DNA molecules as templates for inverse 
PCR. Three primers, one from the IS terminal region of Tn5 (IR2: 5 1 - 
CGGGATCCTCACATGGAAG TCAGATCCTG-3'), and two flanking the unique BamHI 
restriction site of Tn5 (BL: 5 -GGGGACCTTGCACAGATAGC-3', and BR: 5'- 
CATTCCTGTAGCGGATGGAGATC-3 '), were used to amplify the sequences flanking 
10 the Tn5 insertion in the genome of LS157. The amplified fragments (1.4-kb and 6.0-kb) 
were cloned into pZErO-2, yielding pZEL and pZIR (Figure 1). 

PCR was performed in a volume of 50 jul with 200 ng of genomic DNA (or 10 ng 
of plasmid DNA), 0.4 ng/^iL of each of primer, 0.2 mM of each dNTP, 1.8 mM Mg 2+ , and 
1 unit of elongase enzyme mix (Life Technologies). A 10-min initial denaturation step at 
15 94° C was followed by 35 thermal cycles of denaturation at 94° C for 1 min, annealing at 
55° C for 1 min, and extension at 72° C for 1 min per 1 kb of expected amplification. 

Construction of promoter probes and glucuronidase assay 

Plasmid pRG960sd contains a promoterless ^-glucuronidase gene (uidA) 
downstream of a multiple cloning site (Van den Edde et aL, 1992). Sequence upstream of 

20 xabB (nucleotide residues 1005 to 1210 or 521 to 1210) was amplified from pLXABB by 
PCR. Forward primer P1F1 (5 ! -ACG CGGATCC CAGCAGGGTGTCATACACG-3'), or 
P1F2 (5 T -TCGC GGATCC GCGCGATTGAAGTAGTCC-3') contained a BamHI 
restriction site (underlined). Reverse primer P1R (5- 
TCCCCCGGGCGGCCAGCGTGGTGCTACTAC-3 1 ) introduced a Xmal restriction site 

25 (underlined). PCR fragments were Ugated into BamWJXmal-cut pRG960sd, yielding 
pRG960pl and pRG960p2. These constructs were mobilised from E. coli DH5cc into X. 
albilineans LSI 55 as described below. 
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Promoter strength was quantified by fluorometric analysis of glucuronidase 
activity (Jefferson, 1987; Xiao et al. 9 1992). The protein content in cell lysates was 
determined by the dye-binding method (Bradford, 1976) using a Bio-Rad protein assay kit 

Bacterial conjugation 

5 DNA transfer between E. coli donor (JM109 pLAFR3 ± insert, or DH5a 

pRG960sd ± insert) and X albilineans recipient (LSI 57 or LS155) was accomplished by 
triparental transconjugation with helper strain pRK2013. Mid-log-phase cultures of the 
recipient were spotted onto agar plates containing YEB medium with no antibiotics (20 \iL 
per spot). After the liquid was absorbed by the agar, 20 of mid-log-phase culture of the 
10 helper was added to each spot. The liquid was again allowed to absorb, and 20 [iL of mid- 
log-phase culture of the donor was added to each spot. After incubation of the mating 
plates overnight at 28° C, transconjugants were selected on SP plates supplemented with 
ampicillin, and tetracycline or spectinomycin. 

Assay and quantification of albicidin production 

15 Albicidin was quantified by a microbial plate bioassay as described previously 

(Birch and Patil, 1985b), except that the 10 mL basal layer of LB agar and the 5mL 
overlayer of 50% LB with 1% agar were supplemented with tetracycline or spectinomycin, 
and E. coli DH5a pLAFR3 or pRG960sd was used as the indicator strain. This change 
avoided interference by tetracycline or spectinomycin, which were added to some cultures 

20 to ensure retention of pLAFR3 or pRG960sd derivatives in X. albilineans. Inhibition zone 
widths in the bioassay were converted to albicidin concentrations by interpolation on a 
dose-response plot produced under the same assay conditions. The plot fits the formula: 
Log [Alb] = 0.3 W - 0.92, where [Alb] is units of albicidin per 20 \xL sample assayed, and 
W is the width in millimetres of the zone of growth inhibition surrounding each well. 

25 Results 

Cloning and sequencing of xabB gene required for albicidin production 

Xanthomonas albilineans Tox' mutant LSI 57 contains a single Tn5 insertion, in a 
4.1 kb Clal restriction fragment or a 16.5 kb EcdKL restriction fragment (Figure 1). 
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Selection for kanamycin resistance, following shotgun cloning of Clal restriction 
fragments of LS157 DNA into pBluescript II SK, yielded clone pBC157. Sequences 
flanking the Tn5 insertion in LS157 DNA were amplified by inverse PCR, and . 4oned into 
pZErO-2, producing pZIL and pZIR. Plasmid pLXABB was screened from a X. 
5 albilineans Xal3 EcoRI genomic library with probes described in Figure IB. Subclones 
pSEBL and pSEBR were derived from pLXABB (Figure 1C, Table 1). 

The double-strand sequence of thel6,511 bp EcoRI genomic fragment in 
pLXABB was obtained by a primer-walking approach, using subclones pBC157, pZIL, 
pZIR, pSEBL, and pSEBR. The Tn5 insertion in the genome of LSI 57 is accompanied by 
10 9-bp perfect repeat sequence (GTCCTGAAG), commencing at 2490 bp in GenBank 
accession no. AF239749. 

The only ORF longer than 900 bp within the 16.5-kb fragment is disrupted by the 
Tn5 insertion. This ORF (designated xabB) encodes a protein of 4081 aa (Mr 525,695). It 
commences at 1230 bp in GenBank accession no. AF239749 with a TTG codon, 6 bp 

15 downstream from a ribosome binding sequence (RBS) GAGG, which may impose post- 
transcriptional control on the rate of gene product formation (McCarthy and Gualerzi, 
1990). There is an alternative start codon (ATG) a further 15 bp downstream. Of the 
codons in this ORF, 8.5% are rarely used in E. coli. The closest match (TTGAGC-14x- 
TATAAC) to the consensus -35 (TTGACA) and -10 (TATAAT) sequences for E. coli a 70 

20 promoters occurs 1 17 bp upstream of the translation initiation codon (Figure 2). 

Downstream by 35 bp from the TAG stop codon of xabB is a probable RBS 
(GAGG), separated by 6 bp from the ATG start codon of another ORF (designated xabQ 
in the same orientation as xabB. Overlapping the xabB promoter region is another probable 
promoter for a divergent transcript including a putative RBS (TGGAGG) and start codon 
25 for a gene designated xatA, separated by 233 bp from xabB (Figure 1, 2). 

Complementation of xabB gene in LSI 57 

Mobilisation of pLAFR3, pLXABB 1 or pLXABB2 by bacterial conjugation into 
Tox" mutant LS157 occurred at a frequency of 1.5 x 10* 2 transconjugants/recipient cells. 
Albicidin production was undetectable in Tox' mutant LSI 57 and LSI 57 (pLAFR3) 
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controls, but introduction of the xabB gene onpLXABBl or pLXABB2 restored albicidin 
production to the level of the wild-type parental strain LSI 55 (Figure 4). 



Functional analysis of xabB promoter region 

GUS activity was undetectable in LS155 and LS155 (pRG960sd) controls. 
5 Plasmid pRG960pl or pRG960p2, with 206 bp or 690 bp from the xabB promoter region 
upstream of GUS, both conferred GUS activity with no difference in expression level or 
pattern in X albilineans LSI 55 (Figure 5). 



Discussion 

Albicidin was partially characterised as a low-molecular-weight compound that 
10 contains 38 carbon atoms with 3-4 aromatic rings (Birch and Patil, 1985a). The compound 
is not degraded by peptidases (Birch and Patil, 1985a), but it is cleaved by the AlbD 
esterase (Zhang and Birch, 1997). Based on the deduced functionality of the synthase 
describe herein, albicidin is likely to be a complex polyketide, condensed with amino 
acid(s), or nonproteinogenic amino, hydroxyl and carboxyl acid(s) by C-N, amide or ester 
15 bond formation. 

The characterisation of XabB as a multi-modular hybrid enzyme provides new 
insights into the mechanism of albicidin biosynthesis and possible approaches to engineer 
the overproduction of albicidins. For example, the complementation experiments (Figure 
4) indicate that increased copy number of xabB stimulates early production of albicidin, 

20 but other factors become limiting during idiophase. It may be possible to increase 
expression of the albicidin synthase by modifications to the promoter and TTG start codon, 
or to improve albicidin yields by supplying candidate substrates (such as shikimate-derived 
units). The unusual enzyme organisation also contributes to the emerging understanding of 
how microbes generate structural diversity of antibiotics, and can facilitate combinatorial 

25 engineering of antibiotics of mixed peptide/polyketide origin. 
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EXAMPLE 2 

W**. ^ihinHr and Phvtoto x™ *in*mdu*h in Xanthomonas albilineans Requires a 
Phnxphn pantethein vl Transferase Gene 

Materials and Methods 

5 Bacterial st rains and plasmids 

The properties of bacteria and plasmids used in this Example are listed in Table 3. 

Media, culture conditions and antibiotics 

X. albilineans strains were routinely cultured on SP medium (Birch & Patil, 
1985b) at 28° C. Escherichia coli DH5a and JM109 we used as hosts in cloning 

10 experiments and were grown on LB medium at 37° C (Sambrook et al, 1989). Broth 
cultures were aerated by shaking at 200 r.p.m. on an orbital shaker. Modified YEB medium 
(Van Larebeke et al, 1977) for patch mating consisted of 10 mg ml" 1 peptone, 5 mg mL" 1 
yeast extract, 5 mg mL' 1 NaCl, 5 mg mL" 1 sucrose and 0.5 mg mL" 1 MgS0 4 .7H 2 0. The 
following antibiotics were added to media as required: 50 jag kanamycin mL 4 ; 15 ug 

15 tetracycline mLl" 1 ; 100 ug ampicillin mL' 1 . 

Assay of albicidi n production 

Albicidin was quantified by a microbial plate bioassay as described previously 
(Birch and Patil, 1985b), except that the 10 mL basal layer of LB agar and the 5 mL 
overlayer of 50% LB with 1% agar were supplemented with tetracycline, and E. coli 
20 DH5a [pLAFR3] was used as the indicator strain. This change avoided interference by 
tetracycline, which was added to some cultures to ensure retention of pLAFR3 lerivatives 
mX. albilineans. 

Routine genetic procedures 

Bacterial genomic DNA and plasmid DNA isolation, gel electrophoresis, DNA 
25 restriction digests, ligation reactions and transformation were performed by routine 
procedures (Sambrook et al, 1989). DNA fragments were excised from agarose gels and 
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residual agarose was removed with the BRESAclean™ DNA purification kit (Gene Works, 
Adelaide). 

DNA sequencing and analysis 

Sequencing reactions were performed by dideoxynucleotide chain termination 
5 (Sanger et al, 1977) using the BigDye™ Terminator Cycle Sequencing Kit and 373A 
DNA sequencer (PE Applied Biosystems) through the Australian Genome Research 
Facility. Oligonucleotide primers were purchased from GeneWorks (Adelaide). University 
of Wisconsin Genetics Computer Group (UWGCG) programs BLASTP, FASTA, PILEUP, 
and BESTFIT were used through WebANGIS version 2.0 for DNA and protein sequence 
1 0 analyses of the GenBank, EMBL, PIR and S WISSPROT databases using standard defaults. 

Cloning of Tn5 flanking sequences 

JScoRI-digested genomic DNA from X albilineans Tox" mutant LSI 56 was 
ligated into pBluescript II SK and electroporated into E. coli DH5oc. Transformants were 
selected on LB medium containing kanamycin and ampicillin, yielding clone pBEAl, from 
15 which subclones pCEAl and pPEAl were obtained (Figure 1). 

Amplification of sequences from wild-tvpe LS155 bvPCR 

Sequences flanking the Tn5 insertion in LS156 were used to design primers (A1F: 
5'-TTTGGGTTGGATCGGGTAG-3' and AIR: 5'-CCTTCTCGTCCTTG CTCTTC-3') 
for PCR-amplification of the corresponding wild type X. albilineans LSI 55 chromosomal 

20 DNA. PCR was performed in .a volume of 50 |iL with 200 ng of genomic DNA, 0.4 ng 
\xL~ l of each of primer, 0.2 mM of each of dNTP, 1.8 mM Mg 2+ , and 1 unit of elongase 
enzyme mix (Life Technologies). A 4-min initial denaturation step at 94° C wss followed 
by 35 thermal cycles of denaturation at 94° C for 1 min, annealing at 55° C for 1 min, and 
extension at 72° C for 2 min. The amplified DNA fragment was cloned into pGEM-T to 

25 give pGTAl (Figure 1). 

Construction of expression vectors 

The coding region of the xabA gene was amplified from pGTAl by PCR. Primer 
A1F1 (S'- GGAATTC C ATGCCCAATGCCGTACCG-3 ') contained an EcoRL restriction 
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site (underlined) for insertion of the amplified gene into the correct reading frame of lacZ 
in pLAFR3. Primer A1R1 (5 '-CGGGATCCCGTGCTCACC AGGCGTAGTGG-3 ') 
introduced a BamHI restriction site (underlined), 5 bases downstream from the stop codon 
of the amplified gene. The amplified DNA fragment was digested with EcoRI and BamHI, 
5 and ligated with £coRI/5awiHI-digested pLAFR3 to result in pLXAB A. 

Similarly, the coding region of the entD gene was PCR-amplified from E. coli 
DH5ct by colony PCR using primers EntDF (5'- 
TCCCGGAATTCC ATGGTCG ATATGAAAACT ACGC-3 ') and EntDR (5'- 
GCCCAAGCTTCTAATCGTGTTGGCACAGCGTTATG-3'), then ligated into pLAFR3 
10 to produce pLENTD. The inserts in pLXABA and pLENTD were sequenced to confirm 
the expected clones. 

Bacterial triparental mating 

DNA transfer between E. coli donor (JM109 pLAFR3 ± insert) w&X. albilineans 
recipient (LS155 or LS156) was accomplished by triparental transconjugation with helper 

15 strain pRK2013. The mid-log-phase cultures of the recipient were spotted onto agar plates 
containing YEB medium with no antibiotics (20 ^iL per spot). After the liquid was 
absorbed by the agar, 20 |xL of mid-log-phase culture of the helper was added to each spot. 
The liquid was again allowed to absorb, and 20 |al of mid-log-phase culture of the donor 
was added to each spot. After incubation of the mating plates overnight at 28° C, 

20 transconjugants were selected on SP plates supplemented with tetracycline and ampicillin. 

Results 

Cloning and sequencing of the xabA gene required for albicidin production 

Xanthomonas albilineans Tox" mutant LSI 56 contains a single Tn5 insertion, in a 
3.0-kb EcoRI restriction fragment (Wall & Birch, 1997). Selection for Tn5-encoded 
25 kanamycin resistance, following shotgun cloning of .EcoRI restriction fragments of LSI 56 
DNA into pBluescript II SK, yielded pBEAl (Figure 8). 

Both strands of the insert in pBEAl excluding the Tn5 insertion were sequenced 
by primer-walking from T3 and T7 vector sequences in pBEAl and subclones pCEAl and 
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pPEAl. The corresponding genomic region was amplified from wild-type X. albilineans 
LS155 by PCR, and cloned into pGEM-T to give pGTAl. Sequencing of pGTAl revealed 
that a 9-bp imperfect repeat sequence (TTGGCCACG) in the genome of LS156 
accompanied the Tn5 insertion (following base number 1869 in Figure 9). The double- 
5 strand nucleotide sequence of the 2989 bp wild type EcoRL fragment is deposited in 
GenBank under accession no. AF191324. 

Reading frame analysis of the 3 kb EcoRI fragment revealed that only one ORF 
(designated xabA) is disrupted by the Tn5 insertion. This ORF encodes a protein of 278 aa 
(Mr 29 277), with 6.12% codons rarely used in E. coll There were no close matches to E. 

10 coli -10 (TATAAT) and -35 (TTGACA) consensus promoter sequences, and no 
appropriately spaced RBS sequence (such as AGGA or GAGG) in the region upstream of 
the putative start codon ATG (Figure 9). A region of GC-rich dyad symmetry with a free 
energy of -10.2 kcals/mol was found, followed by two TCTC boxes that closely resemble 
the TCTG consensus sequence characteristic of many factor-independent termination sites 

15 (Brendel & Trifonov, 1984; Piatt, 1986) downstream of the TGA termination codon of 
xabA. 

Comparison of XabA with other bacterial PPTases 

A search for proteins with homology to the deduced xabA product, using the 
FASTA and BLASTP and SWISSPROT programs, indicated regions of similarity to EntD 

20 from Escherichia coli (170 aa overlap, 35.9 % identity, 56.5 % similarity), Shigella 
flexneri (180 aa overlap, 35.0 % identity, 55.6 % similarity), Salmonella typhimuriam (184 
aa overlap, 35.9 % identity, 62.0 % similarity), and Salmonella austin (172 aa overlap, 
36.1 % identity, 61.1 % similarity). XabA contains (V/I)G(V/I)D and 
(FAV)(S/C/T)xKE(S/A)xxK domains characteristic of the phosphopantetheinyl transferase 

25 (PPTase) superfamily, and shares 17-36 % overall identity, 39-62 % overall similarity, 
with other bacterial PPTases (Table 4). 

Enhanced expression of xabA by complementation in LSI 56 r esults in increased 
production of albicidins 

Mobilisation of pLAFR3 or pLXABA (pLAFRirjcabA) by triparental matings 
30 into Tox mutant LS156 occurred at a frequency of 1.5 x 10* 2 transconjugants/recipient 
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cells. Albicidin production was undetectable in Tox" mutant LS156 and LS156 (pLAFR3) 
controls, but introduction of the xabA gene on pLXABA enhanced albicidin production 
restored albicidin production (Figure 10). In LS156 (pLXABA), as in LS155, albicidin was 
first detectable in late-log-phase cultures (OD550 = 0.7) and was maximal in stationary 
5 phase. Albicidin production was not responsive to IPTG or glucose, and the lac promoter 
driving xabA in pLXABA is considered to express constitutively in X albilineans. The E. 
coli entD gene, expressed from the lac promoter in pLENTD, also complemented the 
jcaM::Tn5 mutation, restoring albicidin production in LS156. 

Discussion 

10 A gene required for albicidin production in X albilineans was isola + ed using a 

' Tn5 mutagenesis and shotgun cloning approach. The ORF interrupted by Tn5 in Tox" 
mutant LS156 is designated xabA. This ORF was isolated from Tox + parent strain LS155, 
and shown to enhance albicidin production early in the production phase in LSI 56 when 
expressed from the lac promoter in pLAFR3. Tn5 insertions typically cause polar 

15 mutations affecting all downstream cistrons in an operon (De Bruijn and Lupski, 1984). 
Complementation of the mutation in LSI 56 by the isolated xabA ORF indicates the 
absence of any downstream cistron involved in albicidin production. There is no consensus 
RBS sequence close to the alternative start codons for this ORF in the X albilineans 
genome. Translation may be initiated without an evident ribosome binding sequence 

20 complementary to the 3 ! end of the 16S rRNA, as observed for some streptomycete genes 
involved in secondary metabolism (Strohl, 1992), and for some chloroplast genes (Kozak, 
1999). 

PPTases play an essential role in priming polyketide, fatty acid, non-ribosomal 
peptide and siderophore biosynthesis (Gehring et aL, 1997a; Lambalot et aL, 1996; 

25 Marahiel et aL, 1997; Walsh et aL, 1997). All polyketide synthase, fatty acid synthetases, 
and non-ribosomal peptide synthetases require post-translational modification to become 
catalytically active (Walsh et aL, 1997). The inactive apo-proteins are converted to their 
active holo-fonns by transfer of the 4'-phosphopantetheinyl (P-pant) moiety of coenzyme 
A to the sidechain hydroxyl of a serine residue in a conserved carrier domain (Lambalot et 

30 aL, 1996; Walsh et aL, 1997). The P-pant moiety serves to covalently tether the growing 
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product, which is assembled by sequential action of multiple catalytic domains in these 
complex synthetases (Walsh et al, 1997). 

A family of more than twenty PPTases is recognised by a common 
(V/I)G(V/I)Dx40-45...(F/W)(S/C/T)xKE(A/S)xxK signature sequence, but overall 

5 sequence homologies are low (Gehring et al, 1997; Lambalot et al, 1996; Nakano et al, 
1992; Quadri et al, 1998a). In E. coli, there are two PPTases with distinct specificities: 
ACPS is active on acyl carrier protein (ACP) domains in fatty acid and polyketide 
synthase; EntD is active on peptidyl carrier protein (PCP) and aryl carrier protein (ArCP) 
domains in peptide synthetases (Lambalot et al, 1996; Walsh et al, 1997). Thus, PPTases 

10 may be partner-protein specific. However, Sfp from B. subtilis appears to be non-specific, 
efficiently activating both fatty acid, polyketide synthase and peptide synthetases (Kealey 
et al, 1998; Mofid et al, 1999; Quadri et al, 1998a). XabA includes the PPTase VOID 
and FSxKESxxK motifs. Although it has highest overall similarity to the peptide-selective 
EntD proteins, the sequence groupings are not sufficiently compelling to predict the 

1 5 specificity of XabA for polyketide synthase or peptide synthetases (Table 4, Figure 1 1). 

Complementation studies have revealed substantial functional interchangeability 
of PPTases in different bacteria. For example, the B. sublitis sjp gene involved in surfactin 
biosynthesis complements mutants in E. coli entD (enterobactin biosynthesis) aid B. brevis 
gsp (gramicidin biosynthesis) (Borchert et al, 1994; Grossman et al, 1993). In vitro, 

20 ACPS from E. coli activates apoproteins from Lactobacillus, Rhizobium and Streptomyces 
(Lambalot et al, 1996). Because XabA shows highest similarity to EntD, we amplified the 
entD-coding region from E. coli, and arranged it for expression from the lac promoter in 
broad host-range vector pLAFR3. This construct (pLENTD) restored albicidin production 
in X albilineans xabA::Tn5 mutant LS156. EntD is a peptide-selective PPTase that 

25 converts inactive apo-EntF and apo-EntB to active holo-enzymes involved in biosynthesis 
of enterobactin in E. coli (Gehring et al, 1997a). Functional complementation of the 
xabA::Tn5 mutation by entD indicates that XabA is a PPTase required for post- 
translational activation of synthetases involved in albicidin production in X albilineans. 
The specificity of EntD for activation of peptide synthetases in E. coli indicates that 

30 albicidin biosynthesis probably involves an XabA-activated peptide synthetase. 
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Some PPTase genes involved in non-ribosomally synthesised peptide biogenesis 
are located near the genes encoding their targets (Quadri et ah, 1998b). For example, B. 
brevis gsp, 5. sublitis sjp, and E. coli entD genes all He within 4 kb of operons encoding 
the target peptide synthetases (Borchert et aU 1994; Coderre & Earhart, 1989; Nakano et 
5 ah, 1992). However, M tuberculosis pptTis not located near the mbt gene cluster encoding 
the target peptide synthetases involved in mycobactin biosynthesis (Quadri et al, 1998b). 
No gene encoding a PPTase has been identified in any of the antibiotic and phytotoxin 
biosynthetic gene clusters characterised from Streptomyces spp. (Gehring et a/., 1997b) 
and Pseudomonas spp. (Bender et al 9 1999). No evident target gene was found within 
10 1282 bp upstream or 870 bp downstream of xabA. Three cosmids spanning about 100 kb in 
two regions of the genome complemented 56 of 58 tested Tox' mutants of X. albilineans, 
but not LS156 (Rott et al., 1996). These results indicate that xabA is not clustered with the 
genes encoding the antibiotic synthetases that it activates. 

Expression of xabA (or an alternative PPTase such as entD) is essential for 
1 5 albicidin biosynthesis. The phosphopantetheinyl transferase gene described herein provides 
new insight into antibiotic biosynthesis in the Pseudomonadaceae, and new opportunities 
to understand and apply albicidins as potent inhibitors of prokaryote DNA replication. This 
gene, together with the xabB provide the means to engineer high level co-expression of the 
albicidin synthetase and its activating PPTase to obtain higher yields of albicidins, and 
20 ultimately to manipulate the elements of this biosynthetic machinery, by mutagenesis or 
otherwise, to produce desired structural variants of this novel antibiotic class. They may 
also indicate a new approach to disease resistance, by engineering plants to interfere with 
the biosynthesis of albicidin toxins, which are key pathogenesis factors for the systemic 
development of leaf scald disease. 

25 EXAMPLE 3 

A methvltransferase eene is involved in albicidin biosynthesis in Xan thomonas albilineans 
Material and Methods 



Bacterial strains and plasmids 

The properties of bacteria and plasmids used in this example are listed in Table 5. 
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Media, culture conditions and antibiotics 

X. albilineans strains were routinely cultured on sucrose peptone (SP) medium at 
28° C (Birch and Patil, 1985b). Escherichia coli strains were used as hosts in cloning 
experiments and were grown on LB medium at 37° C (Sambrook et al. 9 1989). Broth 
5 cultures were aerated by shaking at 200 ipm on an orbital shaker. Modified YEB medium 
(Van Larebere et al, 1977) was used for patch mating. The following antibiotics were 
added to media as required: kanamycin, 50 jig/mL; tetracycline, 15 ^g/mL; ampicillin, 100 
Hg/mL. 

Assay of albicidin production 

10 Albicidin was quantified by a microbial plate bioassay as described previously 

(Birch and Patil, 1985b), except that the 10 mL basal layer of LB agar and the 5 mL 
overlayer of 50% LB with 1% agar were supplemented with tetracycline, and E. coli 
DH5a [pLAFR3] was used as the indicator strain. This change avoided interference by 
tetracycline, which was added to some cultures to ensure retention of pLAFR3 derivatives 

15 mX. albilineans. 

Routine genetic procedures 

Bacterial genomic DNA and plasmid DNA isolation, gel electrophoresis, DNA 
restriction digests, ligation reactions and transformation were performed by routine 
procedures (Sambrook et al. t 1989). DNA fragments were excised from agarose gels and 
20 residual agarose was removed with the BRESAclean™ DNA purification kit (GeneWorks, 
Adelaide). 

DNA sequencing and analysis 

Sequencing reactions were performed by dideoxynucleotide chain termination 
(Sanger et al., 1977) using the BigDye™ Terminator Cycle Sequencing Kit and 373 A 
25 DNA sequencer (PE Applied Biosystems) through the Australian Genome Research 
Facility. Oligonucleotide primers were purchased from GeneWorks (Adelaide). University 
of Wisconsin Genetics Computer Group (UWGCG) programs BLASTP, FASTA, PILEUP, 
and BESTFIT were used through WebANGIS version 2.0 for DNA and protein sequence 
analyses of the GenBank, EMBL, PIR and SWISSPROT databases. 



WO 02/24736 



PCT/AU01/01190 



-88- 

Pprnverv of the downstream sean ce of truncated xabC by IPCR 

Genomic DNA of X. albilineans LS155 was digested with Ncol. Following 
phenol/chloroform extraction and ethanol precipitation, the digested DNA was self-ligated 
at a concentration of 0.5 ^g/mL. The ligated DNA was precipitated with ethanol and 
5 resuspended in sterile H 2 0 to a concentration of 20 ng/uL as template for IPCR. Sequence 
. of the 16.5 kb EcoRI fragment including the 5' region of xabC was used to design primers 
(IF: 5 ' - AAGCGTCGAC AT AGC AGCAG-3 ' and IR: 5'- 

CGGCAACGCATTCGACCTCG-3') for IPCR-amplification of the sequence downstream 
of the EcoBI site of truncated xabC gene. 

10 IPCR was performed in a volume of 50 \xL with 50 ng of template DNA, 0.4 

ng/^L of each of primer, 0.2 mM of each of dNTP, 1.8 mM Mg 2 *, and 1 unit of elongase 
enzyme mix with proof-reading activity (Life Technologies). A 10 min initial denaturation 
step at 94° C was followed by 35 thermal cycles of denaturation at 94° C for 1 min, 
annealing at 55° C for 1 min, and extension at 72° C for 1 min per 1 kb of expected 

15 amplification product. The IPCR product was cloned into pZErO-2 to give pZEXC. Clones 
of construct pZDCC from three independent PCR reactions were sequenced to rule out the 
possibility of PCR-generated errors. 



Tnsertional mutagenesis 

An internal 625 bp Clal-EcoBl fragment of xabC (Figure 13) was firstly cloned 
20 into C/aI/£coRI-digested pBluescript H SK to provide a Kpnl restriction site, then 
subcloned into EcoRJJKpnl-cleaved pJP5603 to yield pJP-BEC. The inserts in pBluescript 
II SK intermediates (pBEC) were sequenced to confirm the expected clones. 

The suicide construct pJP-BEC was transferred from the mobilising strain E. coli 
S17-1 (Xpir) into X. albilineatis LS155. Exconjugant colonies were selected on SP agar 

25 containing kanamycin and ampicillin. Insertional disruption in xabC or thp was verified by 
PCR using primers flanking the expected integration site of pJP-BEC or pJP-BAS and 
extension at 72° C for 1 min as previously described (Zhang and Birch, 1997b). The effect 
on albicidin biosynthesis was determined using the microbial plate assay. Representative 
(Tox-) insertional mutants in xabC (LS-JP1) and thp (LS-JP2) were retained for further 

30 analysis. 
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Construction of expression vectors 

The coding region of the xabC gene was amplified from X albilineans LS155 
chromosomal DNA by PCR Primer A3F (5 '-CGGGATCCCATGGATTCAGCGTTACC- 
3') contained a BamHI restriction site (underlined) for insertion of the amplified gene into 
5 the correct reading frame of lacZ in pLAFR3. Primer A3R (5 '-CCC AAGCTT TCATTAT 
GGGGCCCTCTTGC-3 ') introduced a Hin&TE restriction site (underlined). The amplified 
DNA was digested with BamHI and HindHL, and ligated with BamWHindHI-digested 
pLAFR3 to result in pLXABC. X albilineans Tox mutant LSI 57 contains a single Tn5 
insertion, in a 4.1 kb CM restriction fragment or a 16.5 kb EcoRI restriction fragment 

10 (Figure 12). Selection for kanamycin resistance, following shotgun cloning of Clal 
restriction fragments of LS157 DNA into pBluescript II SK, yielded clone pBC157. 
Sequences flanking the Tn5 insertion in LSI 57 DNA were amplified by inverse PCR, and 
cloned into pZErO-2, producing pZIL and pZIR. The double-strand sequence of the 16,51 1 
bp EcoRI genomic fragment in pLXABB was obtained by a primer-walking approach, 

15 using subclones pBC157, pZIL, pZIR, pSEBL, and pSEBR. The Tn5 insertion in the 
genome of LSI 57 is accompanied by 9-bp perfect repeat sequence (GTCCTGAAG), 
commencing at 2490 bp in GenBank accession no. AF239749. 

Genetic complementation of albicidin biosynthesis 

DNA transfer between E. coli donor (JM109 pLAFR3 ± insert) and X. albilineans 
20 recipient (LS-JP1 or LS-JP2), was accomplished by triparental transconjugation with 

helper strain pRK2013. Mid-log-phase cultures of the recipient were spotted onto agar 

plates containing YEB medium with no antibiotics (20 \\L per spot). After the liquid was 

absorbed by the agar, 20 jaL of mid-log-phase culture of the helper was added to each spot. 

The liquid was again allowed to absorb, and 20 \iL of mid-log-phase culture of the donor 
25 was added to each spot. After incubation of the mating plates overnight at 28° C, 

transconjugants were selected on SP plates supplemented with ampicillin, and tetracycline 

or spectinomycin. 

Transconjugants were tested for albicidin production using the microbial plate 
bioassay. The constructs pLXABB, pLXABC were designed to test complementation in 
30 trans. However, complementation could also occur in cis, by homologous recombination 
between the complementing construct and the insertionally mutated chromosomal gene. To 
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exclude this possibility, the retention of the insertion in xabC was confirmed by PCR, 
using primers from aphA (in the insertion) and xabB (adjoining xabC in the chromosome). 



Results and Discussion 

Clonin g and sequencine of the fall-length xabC gene 
5 Downstream by 45 bp from the TAG stop codon of xabB is the start of an ORF 

(designated xabC) in the same orientation. The 639-bp sequence downstream of the EcdKL 
site of the truncated xabC was amplified from wt X. albilineans LSI 55 using IPCR. The 
double-strand nucleotide sequence of 1515 bp from the stop codon of xabB to the Ncol site 
downstream of xabC (Figure 13) is deposited in GenBank under accession no. AF239750. 

10 The xabC ORF encodes a protein of 343 aa (Mr 37,704). One TCTG-like sequence 
(TGTG) and one typical TCTG box characteristic of many factor independent termination 
sites (Brendel and Trifonov, 1984) occur downstream of the tennination codon (TAA) of 
xabC (Fig. 2). However, the other features typical of such terminators (a region of GC rich 
dyad symmetry, followed by a run of consecutive mymine residues) are not present within 

15 43 5 bp downstream of the xabC stop codon. 

XabC is similar to O-methvltransferases 

The deduced product of xabC shows 22-30% overall identity and 52-60% overall 
similarity to a family of methyltransferases that utilise S-adenosyl-methionine (SAM) as a 
co-substrate for O-methylation of small molecules (Ingrosso et al, 1989; Haydock et al, 

20 1991; Kagan and Clarke, 1994). These enzymes include tetracenomycin polyketrde C-8 O- 
methyltransferase (TcmO, P39896) and C-3 O-methyltransferase (TcmN, P16559) of 
Streptomyces glaucescens, hydroxyneurosporene-O-methyltransferase (P17061) of 
Rhodobacterium capsulatus, and hydroxyindole-O-methyltransferases of rat pineal and 
retina (009179) and chicken pineal gland (Q92056). Three highly conserved motifs in 

25 SAM-dependent methyltransferases are also present in XabC as shown in Figures 13 and 
14. The crystal structure analysis for the methyltransferase-SAM complex (Schlukebier et 
al, 1995) provides firm structural evidence for the role of motif I in SAM binding. 
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Insertional mutagenesis of xabC blocks albicidin biosynthesis 

Insertional mutation in xabC was accomplished using suicide-vector pJP-BEC and 
confirmed by PCR. Six out of eight tested transconjugants were verified by PGR to contain 
insertional mutations in xabC. Albicidin production was undetectable in these insertional 
5 mutants, compared to wt X albilineans LS155 control. The other transconjugants may 
result from integration of the vector at other genomic locations by illegitimate 
recombinations as reported previously (Penfold and Pemberton, 1992). 

Complementation test 

Introduction of the xabC gene in pLXABC or the truncated xabC gene in 

10 pLXABB into insertional mutant LS-JP2 restored albicidin production to the level of the 
wt parental strain LS155. This indicates that xabC is essential for albicidin production in X 
albilineans. The truncated xabC in pLXABB (SEQ ID NO: 106) encodes 277 residues 
(SEQ ID NO: 107), including all of the three conserved motifs of SAM-methyltransferases, 
and appears fully functional by complementation. The continued presence of an insertion 

15 in the chromosomal locus was confirmed by PCR. Thus, complementation was operating 
in trans. This also indicates that no other cistron downstream of xabC is required for 
albicidin production, because insertional mutagenesis typically causes polar mutations 
affecting all downstream cistrons in an operon (De Bruijn and Lupski, 1989). 

Enhanced expression of xabC results in increased production of albicidins 
20 Derivatives of X albilineans strain LS155, in which an xabC gene, or fragment 

thereof, was introduced in trans, were tested for production of albicidin using the bioassay 
described above. The results, presented in Figure 15, show that expression of xabC cloned 
into pLAFR3 in derivatives of X albilineans strain LSI 55 complements an insertional 
mutation in the chromosomal xabC, and also enhances albicidin production eirly in the 
25 production phase. Expression of the first part of the xabB operon, including the full-length 
xabB and a truncated but functional xabC, further enhances albicidin production. 



WO 02/24736 



PCT/AU01/01190 



-92- 

The disclosure of every patent, patent application, and publication cited herein is 
hereby incorporated herein by reference in its entirety. 

The citation of any reference herein should not be construed as an admission that 
such reference is available as 'Trior Art" to the instant application 

5 Throughout the specification the aim has been to describe the preferred 

embodiments of the invention without limiting the invention to any one embodiment or 
specific collection of features. Those of skill in the art will therefore appreciate that, in 
light of the instant disclosure, various modifications and changes can be made in the 
particular embodiments exemplified without departing from the scope of the present 

10 invention. All such modifications and changes are intended to be included within the scope 
of the appended claims. 
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TABLES 
TABLE 1 



Bacterial strains, and plasmids for Example J 



Strain or 
plasmids 


Relevant characteristics : 


*" Reference or 


Strains 






E. coli 






DH5a 


<E>80dlacZAM15, A(lacZYA-argF 


Promega 


JM109 


[F*, lacPZAM15], A(lac-proAB 


Promega 


TOP10 


F, A(mn--hsdRMS-rncrBC), A(are-leu)7697, AlacX74 


Invitrogen 


X albilineans 






Xal3 


Wild-type albicidin producer from sugarcane (Queensland), Ap r 


Inventor's 
laboratory 


LSI 55 


Wild-type albicidin producer from sugarcane (Queensland), Ap r 


Wall and fcsrrcn 
(1997) 


LSI 57 


LS155::Tn5, albicidin deficient (Tox^, Km r SfAp r 


Wall and Birch 
(1997) 


Plasmids 






pBluescript 

nsK 


ColEl origin, E. coli cloning vector, Ap r 


Stratagene 


pZErO-2 


CoLEl origin, E. coli cloning vector, Km r 


Tnvitrocen 


pRK2013 


ColElorigin, IncP, Tra + , helper plasmid, Km r 


Ditta et al 
0980) 


pLAFR3 


RK2 origin, Tra", Mob + , broad host-range cosmid, Tc r 


Stachelhaus 
atal. (1987) 


pRG960sd 


ColEl origin, broad host-range plasmid, contains promoterless uidA with 
start codon and Shine-Dalgarno sequence, Sm r Sp T 


Van den.Edde 
etal (1992) 


pBC157 


9.9-kb Clal fragment carrying Tn5 and flanking sequences from LS157, 
in pBluescript 11 SK, Km T Ap r 


This study 


pZIL 


1 .4-kb fragment, inverse PCR amplified from LS 1 57 in pZErO-2, Km T 


This study 


pZIR 


6.0-kb fragment, inverse PCR amplified from LS 1 57 in pZErO-2, Km r 


This study 


pZTI 


0.9-kb fragment, PCR amplified from LS157 in pZErO-2, Km r 


This study 


pXABB 


1 6.5-kb EcoRI fragment from Xal3 in pBluescript II SK, Ap r 


Tais study 


pSEBL 


7.9-kb EcoRI-Spel frament frompXABB in pBluescript II SK, Ap r 


This study 


pSEBR 


8.6-kb EcoRI-Spel frament frompXABB in pBluescript II SK, Ap r 


This study 
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. Spain or 


Relevant characteristics 


Reference or 


plasmids 




source 


pLXABBl 


16.5-kb EcoRI fragment from pXABB in pLAFR3 (xabB in the same 
direction as lac), Tc r 


This study 


pLXABB2 


16.5-kb EcoRI fragment from pXABB inpLAFR3 (xabB in the opposite 
direction to lac), Tc r 


This study 


pRG960pl 


206-bp BamHI-Xmal frament inpRG960sd, Sm r Sp r 


This study 


pRG960p2 


690-bp BamHI-Xmal frament in pRG960sd, Sm r Sp r 


This study 
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TABLE 2 

Comparison of conserved sequences in peptide synthetases and X abB 



Domain 




Sequence conserved in 
peptide synthetase^ 1 


Sequence in XabB 


Position in 
Xab (aa) 


Adenylation 


Al 


L(T/S)YxEL 


WSYAQL 


3806-3811 




A2 


LKAGxAYL (V/L) P (L/l) D 


FKAGAC YVP ID 


DO J 1-JOOl 




A3 


LAYxxYTSG (S/T) TGxPKG 


LACVMVTS GS TGRP J\(j 






A4 


FDxS 


FAVS 


3967-3970 




A5 


NxYGPTE 


NNYGCTE 


4063-4069 




A6 


GELxIxGxG (V/L) ARGYL 


GELHVHSVGMARGYW 


4114-4128 




A7 


Y (R/K) TGDL 


YKTGDM 


4152-4157 




A8 


GRxDxQVKIRGxRIELGEIE 


GRQDFEVKVRGHRVDTRQ 
VE 


4170-4189 




A9 


LPxYM(l/V)P 


LPTYMLP 


4239-4245 




A10 


NGK (V/L)DR 


NGKLDR 


4259-4264 












Jrcpiiuyi uaixici 

protein 


PCP 


DxFFxLGG(H/D)S(L/l) 


DNFFALGGHSL 
MDFFAVGGHSV 


4306-4316 
3261-3271 












Condensation 


CI 


SxAQxR(L/M) (W/Y)xL 


TYAQERLWLV 
SLFQERLWFV 


3333-3342 
4374-4383 




C2 


RHEXLRTXF 


RHEVLRTRF 
RHEILRTRF 


3381-3389 
4421-4429 




C3 


MHHXISDG(W/V)S 


IHHIISDGWS 
MHHLIYDAWS. 


3456-3465 
4498-4507 




C4 


YXD(F/Y)AVW 


YADYALW 
YADYAIW 


3495-3501 
4538-4544 




C5 


(I/V)GXFVNT(Q/L) (C/A)xR 


IGFFINILPLR 
IGFFINILPLR 


3606-3617 
4649-4659 




C6 


(H/N) QD (Y/V) PFE 


HQSVPFE 
NQALPFE 


3641-3647 
4685-4691 




C7 


RDxSRNPL 


RDSSQIPL 
RDTSRIPL 


3658-3665 
4701-4708 



a SourcedJrom reference (Marahiel et aU 1997), 
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T ABLE 3 



Bacterial strains, and plasmids for Example 2 



Strain or 
plasmids 


Relevant characteristics 


Reference 
or source 


Strains 






E. coli 






DH5oc 


O>80dlacZAM15, recAl, endAl, gyrA96, thi-1, hsdR17(r v \ mf) supE44, 
relAl, deoR, A(lacZYA-argF)U169 


Promega 


JM109 


[F, traD36, proAB, IacI q ZAM15], recAl, endAl, gyrA96, thi hsdR17(r k \ 
mic 4 ), supE44, relAl, A(lac-proAB) 


Promega 


X. albilineans 






Xal3 


Wild-type albicidin producer from sugarcane (Queensland), Ap r 


This 

k boratory 


LS155 


Wild-type albicidin producer from sugarcane (Queensland), Ap r 


Wall& 
Birch (1997) 


LSI 56 


LS155::Tn5, albicidin deficient (Tox*), Km r Sf Ap r 


Wall& 
Birch (1997) 


Plasmids 






pBluescript II 
SK 


ColEl origin, E. coli cloning vector, Ap r 


Stratagene 


pGEM-T 


ColEl origin, E. coli TA-cloning vector, Ap r 


Promega 


pRK2013 


ColElorigin, IncP, Tra + , helper plasmid, Km r 


Ditta et al. 
(1980) 


pLAFR3 


RK2 origin, Tra", Mob + , broad host-range cosmid, Tc r 


Staskawicz 
et al.(1987) 


pBEAl 


8.8-kb EcoRI fragment carrying Tn5 and flanking sequences from 
LS156, in pBluescript II SK, Km r Ap r 


This study 


pCEAl 


1766-bp EcoXl-Clal fragment from pBEAl in pBluescript E SK, Ap r 


This study 


pPEAl 


697-bp EcoRl-PstI fragment frompBEAl in pBluescript II SK, Ap r 


This study 


pGTAl 


2. 1-kb fragment, PCR amplified from LSI 55 in pGEM-T, Ap r 


This study 


pLXABA 


834-bp EcoRl-Bamm fragment (xabA ORF) from pGTAl in pLAFR3, Tc r 


This study 


pLENTD 


630-bp EcoKL-HiruW fragment (entD ORF) from DH5D in pLAFR3, Tc r 


lliis study 
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T ABLE 4 

Similarity between XabA and other PPTases involved in antibiotic a n d fatty acid 



biosynthesis in bacteria 



Pathway - 


Protein 


Organism ■ 


Specificity 
(A/P)t. 


Domain I . 




DomainD . 


Homology 
' (ID/SIM) 


Albicidin 


XabA 


X . albilineans 


? 


GVGIDLERP-- 


(X)39- 


-FSAKESLFKAJY 


- 


Enterobactin 


EntD 


E . coli 


Pt 


PIGIDIBBI-- 


(X)3S- 


-FSAKESAFKASE 


35.9/56.5 






S . f lexneri 


? 


PIGVDIEEI-- 


(X)36- 


- F S AKE S AFKAS 


35. 0/55. S 






c t-trWVi -i miiTi urn 

o . uypniLLiux xuui 


7 


RIGIDIEKI-- 


(X)35- 


-FSAKESVYKAFQ 


35.9/62.0 






S . auetin 


? 


RVGVDIEKI- - 


(X)35- 


- FS AXES VYKAfc Q 


36.1/61.1 


Mycobactin 


PptT 


M. tuberculosis 


P 


SVG I DAE PH- - 


(X)34- 


-FCAKEATYKAWF 


30.5/55.5 


Surf act in 


Sfp 


B . subtilis 


A/P* 


FIGIDIEKT-- 


(X)35- 


-WSMKESFIKQE3 


24.8/48.5 




PBf-l 


B.pundlus 


? 


FVGIDIEEI - - 


(X)35- 


- W SMKEAF I KLTG 


19.8/47.6 


Gramicidin 


Gsp 


B.brevis 


Pf 


PVGIDIERI-- 


(X)35- 


-WTIKESYIKAIG 


20.B/42.0 


Iturin A 


Lpa-14 


B. subtilis 


? 


PIGIDIEKM- - 


(X)35- 


-WSMKESFIKQAG 


20.0/43.4 


Fatty acida 


HI0152 


H. influenzae 


? 


AVGIDIEFP-- 


(X)34- 


-WCLREAVLKSQG 


19.7/45.7 




AcpS 


E. coli 


A* 


GLGTDIVEI - - 


(X)40- 


-FAVXEAAAKAFG 


16.5/38.8 






M . tuberculosis 


A 


GVGIDLVSI-- 


(X)41- 


- WAAKEAVI KAWS 


25.7/47.6 






B. subtilis 


? 


GIGLDITEL- - 


<X)41- 


- FAAKEAFSKAFG 


25.5/46.2 


PPTase domain 


* 






(V/I)G(I/V)D (F/W) (S/C/T)XKE(S/A)XXK 
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TABLE 5 

Bacterial strains, and plasmids for Example 3 



Strain or 
plasmids . 




Reference or : 
source 


Strain 






E. coli 






DH5cc 


<X>80dlacZAM15, A(IacZYA-argF)U169 


Promega 


JM109 


[F l , lacI q ZAM15], A(lac-proAB) 


Promega 


TOP10 


F, A(mrr-hsdRMS-mcrBQ, A(are-leu)7697, AlacX74 


Invitrogen 


S17-l\pir 


S17-1 lysogenized with Apir 


Penfold and 
Pemberton(1992) 


X. albiUneans 






Xal3 


wt albicidin producer from sugarcane (Queensland), Ap r 


Our laboratory 


LS155 


wt albicidin producer from sugarcane (Queensland), Ap r 


Wall and Birch 


LSI 57 


xabB::Tn5, albicidin deficient (Tox"), Km r Si Ap r 


Wall and Birch 
(1997) 


LS-JP1 


thp::pJP-BAS, albicidin deficient (Tox - ), Km r Ap r 


This work 


LS-JP2 


xabC::pJP-BEC, albicidin deficient (Tox ), Km r Ap r 


This work 


Plasmids 






pBluescript II 
SK 


ColEl origin, E. coli cloning vector, Ap r 


Stratagene 


pZErO-2 


ColEl origin, E. coli cloning vector, Km r 


Invitrogene 


pRK2013 


ColElorigin, IncP, Tra + , helper plasmid, Km r 


Ditto era/. (1980) 


pLAFR3 


RK2 origin, Tra", Mob + , broad host-range cosmid, Tc r 


Staskawicz et al 
(1987) 


pJP5603 


Bacterial suicide vector, Km r 


Penfold and 
Pemberton(1991) 


pZDCC 


1 kb IPCR product in pZErO-2, Km r 


This work 


pBAS 


278 bp Apal-Sall fragment of thp in pBluescrpt II SK, Ap r 


This work 
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Strain or 
plasmids 


. Characteristics 


Reference or 
source 


Tr> dap 

pJP-BAo 


OCA Virt QqIT TTrmT frn(rmf>rti frrvm nRAS in nJP5606 Km r 


This work 


pBEC 


625 bp Clal-EcoRI fragment of xabC inpBluescript II SK, Ap r 


This work 


rrfP-BEC 


655 bp EcoRI-Kpnl fragment frompBEC inpJP5603, Km r 


This work 


pLTHP 


1226 bp EcoRI-BamHI fragment from pLXABB in pLAFR3, 
Tc r 


This work 


pLXABC 


1029 bp xabC ORF amplified from LSI 55 in pLAFR3, Tc r 


This work 


pLXABB 


16.5 kb EcoRI fragment from Xal3 in pLAFR3, Tc r 


This work 
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CLAIMS 

1. An isolated polypeptide comprising at least one domain selected from the group 
consisting of: 

(a) an acyl-CoA ligase (AL) domain comprising a sequence set forth in any one or 
5 more of SEQ ID NO: 6 and 8, or variants thereof. 

(b) a /3-ketoacyl synthase (KS) domain comprising a sequence set forth in any one or 
more of SEQ ID NO: 10, 12, 14, 16, 18 and 20, or variants thereof; 

(c) a j3-ketoacyl reductase (KR) domain comprising the sequence set forth SEQ ID 
NO: 22, or variants thereof; 

10 (d) an acyl carrier protein (ACP) domain comprising a sequence set forth in any one 

or more of SEQ ID NO: 24, 26 and 28, or variants thereof; 

(e) an adenylation (A) domain comprising a sequence set forth in any one or more of 
SEQ ID NO: 30, 32, 34, 36, 38, 40, 42, 44, 46 and 48, or variants thereof. 

(f) a peptidyl carrier protein (PCP) domain comprising a sequence set forth in any 
15 one or more of SEQ ID NO: 50 and 52, and variants thereof; and 

(g) a condensation (C) domain comprising a sequence set forth in any one or more of 
SEQ ID NO: 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78 and 80, or variants 
thereof. 

2. The polypeptide of claim 1, wherein the AL domain comprises each of the sequences 
20 set forth in SEQ ID NO: 6 and 8, or variants thereof. 

3. The polypeptide of claim 1, wherein the KS domain comprises each of the sequences 
set forth in SEQ ID NO: 10, 12 and 14, or variants thereof. 

4. The polypeptide of claim 1, wherein the KS domain comprises each of the sequences 
set forth in SEQ ID NO: 16, 18 and 20, or variants thereof. 

25 5 . The polypeptide of claim 1 , wherein the A domain comprises each of the sequences set 
forth in SEQ ID NO: 30, 32, 34, 36, 38, 40, 42, 44, 46 and 48, or variants thereof. 

6. The polypeptide of claim 1 , wherein the C domain comprises each of the sequences set 
forth in SEQ ID NO: 54, 56, 58, 60, 62, 64 and 66, or variants thereof. 

7. The polypeptide of claim 1, wherein the C domain comprises each of the sequences set 
30 forth in SEQ ID NO: 68, 70, 72, 74, 76, 78 and 80, or variants thereof. 
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8. The polypeptide of claim 1, wherein the domains are arranged in an N- to C-terminal 
direction as follows: AL-ACP-KS-KR-ACP-ACP-KS-PCP-C-A-PCP-C. 

9. The polypeptide of claim 1, comprising the sequence set forth in SEQ ID NO: 2, or 
biologically active fragment thereof, or variant or derivative of these. 

5 10. The polypeptide of claim 9, wherein the variant has at least 60% sequence identity to 
the sequence set forth in SEQ ID NO: 2. 

1 1. The polypeptide of claim 9, wherein the biologically active fragment is at least 6 amino 
acids in length. 

12. An isolated polypeptide comprising at least a biologically active fragment of the 
10 sequence set forth in SEQ ID NO: 2 or variant or derivative thereof. 

13. The polypeptide of claim 12, wherein the biologically active fragment is at least 6 
amino acids in length. 

14. The polypeptide of claim 12, wherein the biologically active fragment comprises at 
least one domain selected from the group consisting of: 

15 (a) an acyl-CoA ligase (AL) domain comprising a sequence set forth in any one or 

more of SEQ ID NO: 6 and 8, or variants thereof. 

(b) a jS-ketoacyl synthase (KS) domain comprising a sequence set forth in any one or 
more of SEQ ID NO: 10, 12, 14, 16, 18 and 20, or variants thereof; 

(c) a j3-ketoacyl reductase (KR) domain comprising the sequence set forth SEQ ID 
20 NO: 22, or variants thereof; 

(d) an acyl carrier protein (ACP) domain comprising a sequence set forth in any one 
or more of SEQ ID NO: 24, 26 and 28, or variants thereof; 

(e) an adenylation (A) domain comprising a sequence set forth in any one or more of 
SEQ ID NO: 30, 32, 34, 36, 38, 40, 42, 44, 46 and 48, or variants thereof. 

25 (f) a peptidyl carrier protein (PCP) domain comprising a sequence set forth in any 

one or more of SEQ ID NO: 50 and 52, and variants thereof; and 

(g) a condensation (C) domain comprising a sequence set forth in any one or more of 
SEQ ID NO: 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78 and 80, or variants 
thereof. 
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15. The polypeptide of claim 13, wherein the AL domain comprises each of the sequences 
set forth in SEQ ID NO: 6 and 8, or variants thereof. 

16. The polypeptide of claim 13, wherein the KS domain comprises each of the sequences 
set forth in SEQ ID NO: 10, 12 and 14, or variants thereof. 

5 17. The polypeptide of claim 13, wherein the KS domain comprises each of the sequences 
set forth in SEQ ID NO: 16, 18 and 20, or variants thereof. 

18 The polypeptide of claim 13, wherein the A domain comprises each of the sequences 
set forth in SEQ ID NO: 30, 32, 34, 36, 38, 40, 42, 44, 46 and 48, or variants thereof. 

19. The polypeptide of claim 13, wherein the C domain comprises each of the sequences 
10 set forth in SEQ ID NO: 54, 56, 58, 60, 62, 64 and 66, or variants thereof. 

20. The polypeptide of claim 13, wherein the C domain comprises each of the sequences 
set forth in SEQ ID NO: 68, 70, 72, 74, 76, 78 and 80, or variants thereof. 

21. The polypeptide of claim 12, wherein the variant has at least 60% sequence identity to 
said at least a biologically active fragment. 

15 20 The polypeptide of claim 12, wherein the variant has at least 70% sequence identity to 
any one of the amino acid sequences set forth in SEQ ID NO: 6, 8, 10, 12, 14, 16, 18, 20, 

22. 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 
70, 72, 74, 76, 78 or 80. 

23. An isolated polypeptide comprising at least biologically active fragment of the 
20 sequence set forth in SEQ ID NO: 83, or a variant or derivative thereof. 

24 The polypeptide of claim 23, wherein the biologically active fragment comprises at 
least one of the consensus PPTase sequence motifs set forth in SEQ ID NO: 89 or 93, or 

variant thereof. 

25 The polypeptide of claim 24, wherein the biologically active fragment comprises both 
25 the consensus PPTase sequence motifs set forth in SEQ ID NO: 89 or 93, or variant 

thereof. 
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26. The polypeptide of claim 23, wherein the biologically active fragment comprises the 
intervening sequence between said consensus PPTase sequence motifs, which intervening 
sequence comprises the sequence set forth in SEQ ID NO: 91, or variant thereof. 

27. The polypeptide of claim 23, wherein the biologically active fragment comprises a 
5 contiguous sequence of amino acids contained within the sequence set forth in SEQ ID 

NO: 87, or variant thereof 

28. The polypeptide of claim 23, wherein the biologically active fragment is at least 6 
amino acids in length. 

29. The polypeptide of claim 23, wherein the variant has at least 60% sequence identity to 
10 the sequence set forth in SEQ ID NO: 83. 

30. The polypeptide of claim 23, wherein the variant has at least 70% sequence identity to 
any one of the amino acid sequences set forth in SEQ ID NO: 87, 89, 91 or 93. 

31. An isolated polypeptide comprising at least biologically active fragment of the 
sequence set forth in SEQ ID NO: 95, or a variant or derivative thereof. 

15 32. The polypeptide of claim 31, wherein the biologically active fragment comprises at 
least one of the consensus methyltransferase sequence motifs set forth in SEQ ID NO: 99, 
101 or 103, or variant thereof. 

33. The polypeptide of claim 31, wherein the biologically active fragment comprises all the 
consensus methyltransferase sequence motifs set forth in SEQ ID NO: 99, 101 and 103, or 

20 variant thereof 

34. The polypeptide of claim 31, wherein the biologically active fragment comprises a 
contiguous sequence of amino acids contained within the sequence set forth in SEQ ID 
NO: 105, or variant thereof. 

35. The polypeptide of claim 31, wherein the biologically active fragment comprises a 
25 contiguous sequence of amino acids contained within the sequence set forth in SEQ ID 

NO: 107, or variant thereof. 

36. The polypeptide of claim 31, wherein the biologically active fragment is at least 6 
amino acids in length. 
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37. The polypeptide of claim 31, wherein the variant has at least 60% sequence identity to 
the sequence set forth in SEQ ID NO: 95. 

38. The polypeptide of claim 31, wherein the variant has at least 70% sequence identity to 
any one of the amino acid sequences set forth in SEQ ID NO: 99, 101 or 103. 

5 39. An isolated polynucleotide comprising a sequence encoding at least one domain 
selected from the group consisting of: 

(a) an acyl-CoA ligase (AL) domain comprising a sequence set forth in any one or 
more of SEQ ID NO: 6 and 8, or variants thereof. 

(b) a 0-ketoacyl synthase (KS) domain comprising a sequence set forth in any one or 
10 more of SEQ ID NO: 10, 12, 14, 16, 18 and 20, or variants thereof; 

(c) a 0-ketoacyl reductase (KR) domain comprising the sequence set forth SEQ ID 
NO: 22, or variants thereof; 

(d) an acyl carrier protein (ACP) domain comprising a sequence set forth in any one 
or more of SEQ ID NO: 24, 26 and 28, or variants thereof; 

15 (e) an adenylation (A) domain comprising a sequence set forth in any one or more of 

SEQ ID NO: 30, 32, 34, 36, 38, 40, 42, 44, 46 and 48, or variants thereof. 

(f) a peptidyl carrier protein (PCP) domain comprising a sequence set forth in any 
one or more of SEQ ID NO: 50 and 52, and variants thereof; and 

(g) a condensation (C) domain comprising a sequence set forth in any one or more of 
20 SEQ ID NO: 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78 and 80, or variants 

thereof. 

40. The polynucleotide of claim 39, wherein the AL domain is encoded by a nucleotide 
sequence set forth in any one or more of SEQ ID NO: 5 or 7, or variants thereof. 

41. The polynucleotide of claim 40, wherein the AL domain is encoded by a nucleotide 
25 sequence comprising each of the sequences set forth in SEQ ID NO: 5 and 7, or variants 

thereof. 

42. The polynucleotide of claim 39, wherein the KS domain is encoded by a nucleotide 
sequence set forth in any one or more of SEQ ID NO: 9, 11, 13, 15, 17 and 19, or variants 
thereof. 
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43. The polynucleotide of claim 42, wherein the KS domain is encoded by a nucleotide 
sequence comprising each of the sequences set forth in SEQ ID NO: 9, 11 and 13, or 
variants thereof. 

44. The polynucleotide of claim 42, wherein the KS domain is encoded by a nucleotide 
5 sequence comprising each of the sequences set forth in SEQ ID NO: 15, 17 and 19, or 

variants thereof. 

45. The polynucleotide of claim 39, wherein the KR domain is encoded by a nucleotide 
sequence set forth in SEQ ID NO: 21, or variant thereof. 

46. The polynucleotide of claim 39, wherein the ACP domain is encoded by a nucleotide 
10 sequence set forth in any one or more of SEQ ID NO: 23, 25 and 27, or variants thereof. 

47. The polynucleotide of claim 39, wherein the A domain is encoded by a nucleotide 
sequence set forth in any one or more of SEQ ID NO: 29, 31, 33, 35, 37, 39, 41, 43, 45 and 

47. or variants thereof. 

48. The polynucleotide of claim 47, wherein the A domain is encoded by a nucleotide 
15 sequence comprising each of the sequences set forth in SEQ ID NO: 29, 31, 33, 35, 37, 39, 

41, 43, 45 and 47, or variants thereof. 

49. The polynucleotide of claim 39, wherein the PCP domain is encoded by a nucleotide 
sequence set forth in any one or more of SEQ ID NO: 49 and 51, or variants thereof. 

50. The polynucleotide of claim 39, wherein the C domain is encoded by a nucleotide 
20 sequence set forth in any one or more of SEQ ID NO: 53, 55, 57, 59, 61, 63, 65, 67, 69, 71, 

73, 75, 77 and 79, or variants thereof. 

51. The polynucleotide of claim 50, wherein the C domain is encoded by a nucleotide 
sequence comprising each of the sequences set forth in SEQ ID NO: 53, 55, 57, 59, 61, 63 
and 65, or variants thereof. 

25 52. The polynucleotide of claim 50, wherein the C domain is encoded by a nucleotide 
sequence comprising each of the sequences set forth in SEQ ID NO: 67, 69, 71, 73, 75, 77 
and 79, or variants thereof. 
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53. The polynucleotide of claim 39, comprising the sequence set forth in any one of SEQ 
ID NO: 1 or 3, or a biologically active fragment thereof at least 18 nucleotides in length, or 
a polynucleotide variant of these. 

54. The polynucleotide of claim 53, wherein the polynucleotide variant has at least 60% 
5 sequence identity to any one of the polynucleotides set forth in SEQ ID NO: 1 or 3. 

55. The polynucleotide of claim 53, wherein the polynucleotide variant is capable of 
hybridising to any one of the polynucleotides identified by SEQ ID NO: 1 or 3 under at 
least low stringency conditions. 

56. The polynucleotide of claim 39, wherein the polynucleotide variant comprises a 
1 0 nucleotide sequence encoding at least one said domain. 

57. The polynucleotide of claim 56, wherein the nucleotide sequence variant has at least 
60% sequence identity to any one or more of the sequences set forth in SEQ ID NO: 5, 7, 
9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 39, 41, 43, 45, 47, 49, 51, 53, 55, 
57, 59, 61, 63, 65, 67, 69, 71, 73, 75, 77 and 79. 

15 58. The polynucleotide of claim 56, wherein the nucleotide sequence variant is capable of 
hybridising to any one of the sequences identified by SEQ ID NO: 5, 7, 9, 11, 13, 15, 17, 
19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 39, 41, 43, 45, 47, 49, 51, 53, 55, 57, 59, 61, 63, 65, 
67, 69, 71, 73, 75, 77 and 79 under at least low stringency conditions. 

59. An isolated polynucleotide comprising a sequence encoding at least biologically active 
20 fragment of the sequence set forth in SEQ ED NO: 83, or a variant or derivative thereof. 

60. The polynucleotide of claim 59, comprising the sequence set forth in any one of SEQ 
ID NO: 82 and 84, or a biologically active fragment thereof, or a polynucleotide variant of 
these. 

61. The polynucleotide of claim 59, comprising a contiguous sequence of nucleotides at 
25 least 18 nucleotides in length and contained within the sequence set forth in SEQ ID NO: 

86, or variant thereof. 

62. The polynucleotide of claim 59, wherein the polynucleotide variant has at least 60% 
sequence identity to any one of the polynucleotides set form in SEQ ID NO: 82, 84 and 86. 
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63. The polynucleotide of claim 59, wherein the polynucleotide variant is capable of 
hybridising to any one of the polynucleotides identified by SEQ ID NO: 82, 84 and 86 
under at least low stringency conditions. 

64. The polynucleotide of claim 59, wherein the polynucleotide variant comprises a 
5 nucleotide sequence encoding at least one PPTase sequence motif selected from SEQ ID 

NO: 89 and 93, or variant thereof. 

65. The polynucleotide of claim 64, wherein the polynucleotide variant comprises a 
nucleotide sequence encoding the intervening sequence between the said consensus 
PPTase sequence motifs, said nucleotide sequence comprising the sequence set forth in 

10 SEQ ID NO: 91. 

66. The polynucleotide of claim 59, wherein the polynucleotide variant suitably comprises 
a nucleotide sequence encoding a contiguous sequence of amino acids contained within the 
sequence set forth in SEQ ID NO: 87, or variant thereof. 

67. The polynucleotide of claim 66, wherein the contiguous sequence is encoded by the 
15 sequence set forth in SEQ ID NO: 86, or nucleotide sequence variant thereof displaying at 

60% identity thereto. 

68. The polynucleotide of claim 64, wherein the PPTase sequence motif is encoded by a 
nucleotide sequence comprising the sequence set forth in any one of SEQ ID NO: 88 and 
92, or nucleotide sequence variant thereof displaying at 60% identity thereto. 

20 69. The polynucleotide of claim 65, wherein the intervening sequence is encoded by the 
nucleotide sequence set forth in SEQ ID NO: 90, or nucleotide sequence variant thereof 
displaying at 60% identity thereto. 

70. The polynucleotide of claim 66, wherein the contiguous sequence is encoded by the 
sequence set forth in SEQ ID NO: 86, or nucleotide sequence variant thereof displaying at 

25 60% capable of hybridising thereto under at least low stringency conditions. 

71. The polynucleotide of claim 64, wherein the PPTase sequence motif is encoded by a 
nucleotide sequence comprising the sequence set forth in any one of SEQ ID NO: 88 and 
92, or nucleotide sequence variant thereof capable of hybridising thereto under at least low 
stringency conditions. 
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72. The polynucleotide of claim 65, wherein the intervening sequence is encoded by the 
nucleotide sequence set forth in SEQ ID NO: 90, or nucleotide sequence variant thereof 
capable of hybridising thereto under at least low stringency conditions. 

73. An isolated polynucleotide comprising a sequence encoding at least biologically active 
5 fragment of the sequence set forth in SEQ ID NO: 95, or a variant or derivative thereof. 

74. The polynucleotide of claim 73, comprising the sequence set forth in any one of SEQ 
ID NO: 94 and 96, or a biologically active fragment thereof, or a polynucleotide variant of 
these. 

75. The polynucleotide of claim 73, comprising a contiguous sequence of nucleotides 
10 contained within the sequence set forth in SEQ ID NO: 104, or variant thereof. 

76. The polynucleotide of claim 73, comprising a contiguous sequence of nucleotides 
contained within the sequence set forth in SEQ ID NO: 106, or variant thereof. 

77. The polynucleotide of claim 73, wherein the polynucleotide variant has at least 60% 
sequence identity to any one of the polynucleotides set forth in SEQ ID NO: 94, 96, 104 

15 and 106. 

78. The polynucleotide of claim 73, wherein the polynucleotide variant is capable of 
hybridising to any one of the polynucleotides identified by SEQ ID NO: 94, 96, 104 and 
106 under at least low stringency conditions. 

79. The polynucleotide of claim 73, wherein the polynucleotide variant comprises a 
20 nucleotide sequence encoding a methyltransferase sequence motif selected from any one or 

more of SEQ ID NO: 99, 101 and 103, or variant thereof. 

80. The polynucleotide of claim 79, wherein the methyltransferase sequence motif is 
encoded by a nucleotide sequence comprising the sequence set forth in any one of SEQ ID 
NO: 98, 100 and 102, or nucleotide sequence variant thereof displaying at least 60% 

25 identity thereto. 

81. The polynucleotide of claim 79, wherein the methyltransferase sequence motif is 
encoded by a nucleotide sequence comprising the sequence set forth in any one of SEQ ID 
NO: 98, 100 and 102, or nucleotide sequence variant thereof capable of hybridising thereto 
under at least low stringency conditions. 
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82. An expression vector comprising the polynucleotide of any one of claims 39, 59 or 73, 
wherein the polynucleotide is operably linked to a regulatory polynucleotide. 

83. A host cell containing the expression vector of claim 82. 

84. A multiplicity of cell colonies, constituting a library of colonies, wherein each colony 
5 of the library contains an expression vector for the production of the polypeptide of claim 1 

or claim 12. 

85. A method for enhancing the level and/or functional activity of an albicidin, said 
method comprising: 

- introducing into an albicidin-producing host cell (1) an agent that modulates 
10 the expression of a gene encoding at least a portion of the polypeptide of claim 1 or 

variant or derivative thereof, or the level and/or functional activity of an expression 
product of said gene, or (2) a vector from which a polynucleotide encoding at least a 
portion of the polypeptide of claim 1 or variant or derivative thereof can be 
translated; 

15 - and culturing the host cell for a time and under conditions sufficient to 

enhance the level and/or functional activity of said albicidin. 

86. The method of claim 85, further comprising introducing into said host cell a vector 
from which a PPTase can be translated. 

87. The method of claim 86, wherein the PPTase is selected from EntD or XabA. 

20 88. The method of claim 85, further comprising introducing into said host cell a vector 
from which a methyltransferase can be translated. 

89. The method of claim 86, wherein the methyltransferase is XabC. 

90. An antigen-binding molecule that is immuno-interactive with the polypeptide of claim 
1 or claim 12. 

25 91 . An antigen-binding molecule that is immuno-interactive with the polypeptide of claim 
23. 

92. An antigen-binding molecule that is immuno-interactive with the polypeptide of claim 
31. 



WO 02/24736 



PCT/AU01/01190 



- 118 - 

93. A method of preparing a polynucleotide encoding a modified PKS, comprising using a 
nucleotide sequence encoding the polypeptide of claim 1 or claim 12 as a scaffold and 
modifying the portions of the nucleotide sequence that encode enzymatic activities, either 
by mutagenesis, inactivation, deletion, insertion, or replacement. 

5 94. A method for producing polyketides, comprising expressing the modified albicidin 
PKS encoding nucleotide sequence produced by the method of claim 93 in a suitable host 
cell to thereby produce a polyketide different from that produced by said polypeptide. 
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P1F1 

967 GGCGATGCGC GCACACTGCA GGTCCATCAC GCCACCTCCA GCAGGGTGTC 
CCGCTACGCG CGTGTGACGT CCAGGTAGTG CGGT GGAGGT CGTCCCACAG 
A I R A CQL DMM RBS 
xaiA^ I 

1017 ATACACGGCC AGCGGATGCT GCAGGTTTTC CACTGGCAGG GCCACTGGCT 
TATGTGCCGG TCGCCTACGA CGTCCAAAAG GTGACCGTCC CGGTGACCGA 

-35 (P xabB ) -10 (? xabB ) 

1067 GTCGTAAGGG AAGCGGTGCC TTGAGCGCCG GTGCGGACAG TATAAC GACA 
CAGCATTCCC TTCGCCACGG AACTCGCGGC CACGCCTG TC ATAT TG CTGT 

-10 (P^) 

1117 CGTTCCTTGG CCAAGCGCAC TGTCGGCACG GCCTTGCTGA TGCCGCCCAT 
GCAAGG AACC GGT TCGCGTG ACAGCCGTGC CGGAACGACT ACGGCGGGTA 
-35 (P xatA ) 

1167 GTAGCCGCGC GCCTGGATCT CGCGTAGTAG CACCACGCTG GCCGGGATCC 
CATCGGCGCG CGGACCTAGA GCGCAW^^CV QTG(^GCG^C /CGGdCCTAGG 

P1R 

RBS | * > xabB 

1217 AT CGAGG GCG CGCTTGCCCA ATGCGCTCAT GCAGATAACT CTTGTAGCCG 
TAGCTCCCGC GCGAACGGGT TACGCGAGTA CGTCTATTGA GAACTACGGC 
MP NAL MQIT LVA 
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(i).AL 



TSGSSGESKGILLSH- -GYFRTGDL Xal-XabB(AL) 
TGGTTGVAKGAMLTH--GWMATGDI Hin-LCFA 
TSGSTGTPKAVMLNH- -GWFETGDL Bsu-PksJ 
SSGSTGDPKGVMLTH- -GWVKTGDL Bsu-MycA(AL) 
SSGTTGLPKGVMLTH- - GWLHTGDI Pcr-ComL2 
TSGTTGRPKGWSAQ- -GWYRTGDL Sma-FkbB(AL) 
TSGTTGRPKGWSTQ- -GWFRTGDL Ame-RifA(AL) 
TSGTTGTPKGVLSTQ- -GWYRTGDL Shy-RapA(AL) 



(ii). KS 



GPSEVINSACSSSLVAL- -VELHGTGTSL- -ALGHLGAAAG Xal-XabB (KS1) 
GPSLAVDTACSASLTAI- - IEAHGTGTVL- -NIGHAESAAG Xal-XabB (KS2) 
GPSLFVHTNCSSSLSAL- -VEAHGTGTLL - -NLGHLDTVAG Mxa-Tal 
GP AVTVDTACS S SLVAV- - IEAHGTGTKL - -NIGHLFEAAG Bsu-MycA 
GPAVTVDTACSSSLVAL- -VEAHGTGTRL- -NIGHAQAAAG Ser-EryAl 
GPAMTVDTACSSGLTAL- -VEAHGTGTRL- -NIGHTQAAAG Ser-EryA3 
GPSVLVDTACSGGLTAL- -VECHGTGTQA- -NIGHLEGASG Che-PKSl 
GPSLAVDTACSSSLTAI - -LEAHGTGTAL- -NXGHCESAAG Bsu-PksM 
GPSVAVDTACSSSLVAI - -VEAHGTGTLL- - NLGHTE AAAG MtU-PpsA 
GPSLTIDTACSSSLMAL- -VEAHGTGTKV- -NMGHPEPASG Chick-FAS 
GPS I ALDTAC S S S LLAL - - 1 EAHGTGTKV- -NMGHPEPASG Rat-FAS 



* 



(Active site cysteine) (Active site histidine) 



(iii). KR 



VYWIGGAGGLGEVLSEHLIRTYD . AQLIWIGR Xal-XabB 
VYV I S GGTGALARLFVAE IGKRATRATVI L VAR Mxa-Tal 
TVLVTGGTGGVGGQIARWLARRG . APHLLLVSR Ser-EryAl 
TVLVTGGTGGIGAHLARWLARSG.AEHLVLLGR Ser-EryA3 
SYLLVGGVGGLGSATALAMSTRG . ARHLLLINR Che-PKSl 
S Y I ITGGLGGLGLFFASKLAAAG . CGRIVLTAR Mtu-MAS 
S YIITGGLGGFGLELAQWLIERG . AQKLVLTSR Chick-FAS 
S YI ITGGLGGFGLELARWLVLRG . AQRLVLTSR Rat-FAS 



(iv). ACP 

CELALDSLQCVR Xal-XabB (ACPI) 
EYYGVDSIVAIE Xal-XabB (ACP2) 
ESYGVDSIVIIE Xal-XabB (ACP3) 
IGFGLDSIMLTQ Bsu-MycA 
ERYGIDSIIITQ Mxa-Tal 
AELGVDSLSALE Ser-EryAl 
QDYGIDSLVAVE Che-PKSl 
I E YGLD SLGMLE Mtu-MAS 
ADLGLDSLMGVE Chick-FAS 
ADLGLDSLMGVE Rat-FAS 

* (Active site serine) 
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A. X albilineans XabB (4801 aa) 



N-terminus 
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AL ACP1 KS1 KR ACP2 ACP3 KS2 ERG^ feGJ^ ^Rg^LCggg 
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ZggRr 





D. M xaHriias Tal (2392 aa) 











KS 


KR ACP 


KS 



E. B. subtilis PksorfX6 (4447 aa) 
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1054 TTCCCGCCCGAATAGGCGCAGGAAGCCAATAAGTATGGCAGCGCCCTTGACCAATGACAAGCTCATGCACCCAGGACGCCC 

1135 GCTCTGCTCCGCGTCGTCCATCGCCATTGCGCCCCTCCCCGACCCCAAGCATCGACCAAAGGACCGAATGCGGCGGGTAGG 

1216 CGCGACTCTGCGACACTAGCGCAATGTT£TC^ 

-10 RBS 

M P N A V S 

1297 ACCG ATGCAGGGCGCGCGGGGACTCCCGCAGCCGCAAGCGATGAACCCAGGGTTGCCGAGCGTCGGCGGCTTGAGCGCAGG 

PMQGARGLPQ PQ AMNPGLP SVGGIiSAG32 

1378 CCAGCCATTGCAGTTGTCGTTAGCACCGGAACTGCAGGCAGCCGCGCGCAGTGCCCACCGCCATCTGCTCGACGACGGCAC 

QPLQLSLAPELQAAARSAHRHLLDDGT59 

1459 GGCGCTTTACCTGCTGGCGTTCGATACCGCGCAATTCGACCC^ 

ALYLLAFDTAQPDPGAFAAMAIAR PDS 86 

154 0 CATCGCCCGCAGCGTGCGCAAGCGTCAGGCCGAGTTCCTGTTCGGCCGTCTGGCCGCGCGACTGGCGCTGCAAGAGGTGCT 

IARSVRKRQAEFLFGRLAARLALQEVL 113 

1621 GGGACCTGCGCAAGCGCAGGCAGATATTGCAATCGGCGCGACGCGCGCGCCCTGCT 

GPAQAQADIAIGATRAPCWPAGSLGS I 140 

1702 TTCCCATTGCGAGGACTACGCGGCCGCCATCGCCATGGCGGCCGGCACCCGCCACGGCGTCGGCA 

S HCE D YAAA I AMAAGTR H'G [V G I Dj L E R P 167 

1783 AATCAC^CCCGCGGCGCGCGCGGCGTTGCTGAGCATCGCA^ 

ITPAARAALLSIAIDADEAARLAKAAD 194 

VTn5 

1 B 6 4 CGCGCAGTGGCCGCAAGACCTGCTGCTGACCGCACT ATTTT CGGC 

AQWPQDtiLLTAIt |F S] A |K E S| LF0AAYSAV 221 

1945 CGG ACGCTACTTCG ACTTCAG CGCGG CACGCCTGTG CGG CATCGACCTGG CACGGCAATGCCTG CATCTG CGCCTGACCG A 

GRYFD FSAARLCGIDLARQCLHLRIiTE 248 

2026 GACACTCTGCGCGCAATTCGTGGCCGGGCAAGTGTGCGAGGTCGGCTTCGCGCGCCTACCACCGGACCTGGTGCTCACCCA 

TLCAQ FVAGQVCEVGFARL P PDLVLTH 27S 

2107 CTACGCCTGGTGAGCACGCGGACAGTCGAACCCGCCAACGCCZAACGGCACTCAA GACGTGGCG TG CGCCGCGTC GGTCGTG 

YAW* 278 

2188 AAG CTCTC CCCGCAGCCGCACTCGGCGGTGGCATTGGGATTGC^^ 

2269 TCGATTTCGGTGCCATCGACCAACTGCAGACTGGCGGCATCGACATAAATCCGCACTCCGTCCTGCTCGAACACCGCATCG 
2350 TCCG CGCGTGCCTCGTG CG CCAGATCGGTGACATGGCCCCAACCGGAACAGCCTGTGCGTACC ACCCCGAAACGTAG ACCC 
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+1 CUI RBS (A3F) 

xabS stop codon 

TAGCAAAAGCCGGCCGCCGTCACCCGTTCATCGATAGCGA 105 

xabC HDSALFTSAFTFD1FYTT 

V H 20 y. 



GCCTACTATCGCACTGCCGCAGTCAAGGCGGCGATCGAACTGGGGCTATTCGATGTG 
GCGGC 225 

AYYRTAAVKAAIELGLFDVVGQQGRTPAAIAEACQASPR 
G 60 . 

CJal 

ATTCGCATCCTTTGCTATTACCTAG TATCGAT CGGTTTTCTA^ 
TGGGT 345 

IRILCYYtVSIGFLRRNGGLFYIDRNMAMYLDRSSPGYI. 
G 100 

GGCAGC&TCAAGTTCCTGCTCTCGCCCTACATCATGAGCGCCTTCACCGATC 
ACCCG 465 

GSIKPLLSPYrMSAFTDLTAVVRTGKIHLAQDGVVAPDH 
P 140 

CAGTGGGTGGAATTIUCACGCGCGATGGCACCGA^ 

CCGGC 5B5 _ . 

Q H V E FARAMAPHMALPSALIAHMVSLPADRPI R [V L D V A| 

l A G 1 180 

E 1 Motif 

CACGGCCTGTTCGGCATCGCCTTCGCGCAGCGCTTCCGCCAGGCTGAAGTGAGCTrCCT^ 
CCGAG 705 

fr— 3 LFGIAFAQRFRQABVSP1DWDKVLDVARENAQAAKV 
A B 220 

(IR) Hindi 
mirjTzn n-i ■ i i - c i- rJ CCGC^^CGC^TTCGACCTCGATTACGGCAGCGGCTACGA^ 
TGGCT fl25 , 

RARFLPGNAPDLDYG |SGYDVILL| TNPLHHPDEVDGERI 
L A 260 

ScoRI Motif II 

AAGACGCGCGATGCGCTGAACGACGACGGCATGGTGATCACTT^ 
CCACC 945 

K T R D A [LHDDGMVITj FEFIADEERSSPFLAATFSMMMLG 
T T 300 

Motif III Clal 

crnnrvznrmfi a rrrpn* & c. APr^ATAGCGftTCTGGAAACGATGTTTCGGCATGCCGGCTTCGGCCACGTGGAACTAA AATCGATA CCG CCGGCCi' HjCltjAAAGTGGTGGTTTCCC 
GCAAG 1065 

CGTTC 

PAGESYTYSDLERMFRHAQFGHVELKSIPPALLKVVVSR 
K 340 

AGGGCCCCATAATGATCGAATCGGCGACATTCCCTGTGGCGAAAACCGAG 
CCTGC 1185 

TCCCGGGGTATTACT (A3R) 

RAP* 

343 

TCGGTATACGCATGATCGAGATCGGCTCGGACTATCTGGTCT^^ 
CGAGG 1305 

Not I 

CTACCGGCAGCATGGCGGCCTCCATGTGCGTGCCGGCCGGCCAACGTTGCCTTGGCCT 
GCGGC 1425 

Jfcol 

CGCTGCACATAGGGGCCTTGACCCAGGTATGGCAGATGCGCATCT 
1515 
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Xal-XabC 
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174 VLDVAAGHG 
173 FVDLGGARG 
331 IADLGGGDG 
64 VLEIGTFTG 
63 TLEVGVFTG 
85 VLDVGFGLG 
183 VLDVGGGKG 
208 WDIGGADG 
106 VLEVGCGMG 
71 VLDVGCGSG 

Motif I 



236 SGYDVILL 
234 PRADVFIV 
393 TGYDAYLF 
135 GAFDIVFV 
134 GTFDIiAFI 
149 ETFDRVTS 
254 RKADAIIL 
269 GGGDLYVL 
155 VQGDAEEL 
124 GSFDAAWA 

Motif II 



267 ALNDDGMVIT 
263 ALTPGGAVLV 
423 IGDDDARLLI 
159 LVRPGGLVAI 
158 LVRPGGLIIL 
178 VLKPGGVLAI 
273 ALEPGGRIIil 
298 AM PAHARLL V 
194 ALRRGGALSH 
151 VLRPGGRLAV 

Motif III 
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